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HUMAN VPS4A AND VPS4B FUNCTIONS IN VIRAL BUDDING 

L This application claims the benefit of and priority to United States Provisional Application No. 
60/381,027 filed on May 15, 2002 for Human Vps4A and Vps4B Functions in Viral Budding, United 
States Provisional Application No. 60/382,702 filed on May 21, 2002 for Human Vps4A and Vps4B 
5 Functions in Viral Budding, and United States Provisional Application No. 60/451,146 filed on February 
28, 2003 for Human Vps4A and Vps4B Functions in Viral Budding. These applications are specifically 
herein incorporated by reference in their entireties. 

I. ACKNOWLEDMENTS 

2. This invention was made with government support under federal grants AI51174 awarded by 
10 the NIH. The Government has certain rights to this invention. 

II. BACKGROUND OF THE INVENTION 

3. Human TsglOl plays key roles in HIV budding and in cellular vacuolar protein sorting (Vps). 
In performing these functions, TsglOl binds both ubiquitin (Ub) and the PTAP tetrapeptide "late 
domain" motif located within the viral Gag protein. These interactions are mediated by the N-terminal 

15 domain of TsglOl, which belongs to the catalytically inactive ubiquitin E2 variant (UEV) family. 

4. Human TsglOl has recently been identified as the functional receptor required for budding of 
the enveloped human immunodeficiency (HIV) and Ebola viruses (VerPlank et al., 2001; Garrus et ah, 
2001; Martin-Serrano et al., 2001, all of which are herein incorporated by reference at least for material 
related to TSG101). TsglOl is recruited to the sites of virus budding by binding to a PTAP tetrapeptide 

20 motif (the "late domain") located within the p6 region of HIV Gag and the Ebola Vp40 matrix structural 
proteins. Once there, TsglOl appears to recruit other cellular factors that help to complete the budding 
process (Garrus et al, 2001; Martin-Serrano et al., 2001). In the absence of TsglOl, the final membrane 
fission step(s) fail and HIV release is arrested at a very late stage, in which the assembled viral particles 
remain attached to the plasma membrane (and to one another) via thin membrane stalks (Garrus et al., 

25 2001). 

5. Disclosed herein are relationships and interactions related to TSG101 and Vps4A and 4B that 
are integral for HIV budding. 

III. SUMMARY OF THE INVENTION 

6. In accordance with the puiposes of this invention, as embodied and broadly described herein, 
30 this invention, in one aspect, relates to the HIV budding. 

7. Additional advantages of the invention will be set forth in part in the description which 
follows, and in part will be obvious from the description, or may be learned by practice of the invention. 
The advantages of the invention will be realized and attained by means of the elements and combinations 
particularly pointed out in the appended claims. It is to be understood that both the foregoing general 

35 description and the following detailed description are exemplary and explanatory only and are not 
restrictive of the invention, as claimed. 
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IV. BRIEF DESCRIPTION OF THE DRAWINGS 

8. The accompanying drawings, which are incorporated in and constitute a part of this 

specification, illustrate several embodiments of the invention and together with the description, serve to 

explain the principles of the invention. 

5 9. Figure 1 shows human TsglOl Binds HIV-1 p6 (A) HIV-1 Gag domain structure and 

dissociation constants for pure TsglOl UEV protein binding to immobilized HIV-1 GST-p6 and GST-p6- 

Ub proteins. The standard deviation in the dissociation constant for the full length, wild type p6 protein 

was derived from 16 independent measurements; estimated errors in the final significant digits of the 

other dissociation constants (parentheses) were derived from statistical fits of individual isotherms to 1:1 

10 binding models. (B) SDS-PAGE analysis showing induced expression in E, coli and fully purified 

TsglOl UEV. (C) Surface plasmon resonance biosensor analysis of the p6/Tsg 101 interaction. TsglOl 

UEV was injected in triplicate at concentrations of 0, 0.9, 0.27, 0.82, 2.5, 7.4, 22, 67, and 200 jiM over 

GST-p6 captured on an anti-GST surface. Data were double referenced to correct for bulk refractive 

index changes and any nonspecific binding (Myszka, 1999). (D) Representative binding isotherms for 

15 TsglOl UEV interacting with a series of different protein surfaces. (E) Epitope mapping of the TsglOl 
UEV binding site on HIV-1 p6. The graph shows the changes in free energy of TsglOl UEV binding 
(AAG) caused by single alanine substitutions in HIV-1 p6 residues 4-13 (dark bars) or by the P7L, T8I, 
A9R, and P10L mutations (light bars). Positive values indicate that the mutation reduces binding 
affinity, and error bars show estimated standard deviations in single binding isotherms. Broken lines 

20 above the P10L mutant denote a lack of detectable TsglOl UEV binding (see also Fig. 7D). 

1 0. Figure 2 shows TsglOl is Required for Release of HIV-1 from 293T Cells (A) Anti-TsglOl 
Western blot showing depletion of TsglOl protein from 293T cells by siRNA and restoration of TsglOl 
protein using siRNA-resistant expression constructs. Cells were co-transfected twice (2 rnL cultures, 6 
well plates) with plasmid DNA and with 50 nM siRNA or siRNA| NV at 24h intervals (except mock, lane 

25 1 , which received no RNA or DNA). The first siRNA/siRNAiwv co-transfection was with 2fig or 4^g of 
a Tsg-FLAG (siRNA-sensitive) or Tsg*-FLAG (siRNA-resistant) expression construct. The second 
siRNA/siRNAiNv co-transfection was with 0.5|ig HIV-i R9 DNA. In each case, total transfected DNA 
was normalized to 4.5 jag with pIRES2-EGFP. Cytoplasms and supernatants were analyzed 48h after the 
first transfection. (B) Anti-FLAG Western blot showing that the TsglOl *-FLAG construct is resistant to 

30 siRNA. Samples were prepared as in (A). (C) Anti-MA and anti-CA Western blot showing that depletion 
of TsglOl inhibits virus release. From cells transfected as in (A), viral particles were pelleted from the 
supernatant of the transfected cells. Virus release in untreated cells was similar to that in the siRNA INV - 
treated cells (not shown). (D) Anti-MA and anti-CA Western blot showing that cytoplasmic levels of 
HIV-1 Gag, CA and MA proteins are not affected by TsglOl depletion. Note that levels of MA and CA 

35 in cellular membrane fractions increased significantly when virus release was blocked (not shown). (E) 
Viral replication assays showing that release of infectious virus particles is inhibited when TsglOl is 
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depleted and rescued when TsglOl is restored. Titers of infectious HIV-1 particles released into the 
supernatant (shown) were quantitated in single cycle MAGIC assays. 

11. Figure 3 shows the depletion of TsglOl Does Not Affect MLV Release or Infectivity. (A) 
Depletion of TsglOl does not significantly inhibit MLV release from 293T cells. Upper panel: Anti- 
5 TsglOl and anti-14-3-3p Western blot showing specific depletion of TsglOl from 293T cells by siRNA. 
Depletion experiments were performed as described in Fig. 8A, except that the second siRNA co- 
transfection was with 1.5 jug pCLeco and 0.25 jug pCLlacZ MLV vector DNA. Levels of 14-3-3(3 protein 
are shown as loading controls, and a pCLeco vector expressing PPPY'mutant Gag is shown as a positive 
control for arrested budding (lane 7). Middle panel: Anti-MLV CA Western blot showing that virus 

10 release is not significantly reduced by TsglOl depletion. Lower panel: Anti-CA Western blot showing 

cytoplasmic levels of MLV Gag._(B) Depletion of TsglOl does not significantly inhibit MLV infectivity. 
Infectious MLV particles released into the supernatant were quantitated in single cycle infectivity assays 
and normalized to the appropriate controls. In repetitions of these experiments, siRNA treatment reduced 
particle release and infectivity slightly (up to two-fold) relative to the siRNAjnv control. 

15 12. Figure 4 shows a Mutant Vps4 Proteins Dominantly Inhibit HIV-1 Release and Infectivity 

(A) Mutant Vps4 proteins dominantly inhibit HIV-1 release. 293T cells were transfected with 0.5 jug R9 
plasmid encoding wt HIV-1 (lanes 2, 4-10) or a p6 PTAP" mutant HIV-1 (lane 3). Cells were co- 
transfected with pEGFP (1 |ig, lane 2), GFP-Vps4 wt (1 jag, lane 4), GFP-Vps4 K i73G (0.25, 0.5 or 1 fig; 
lanes 5-7), or GFP-Vps4 E2 28Q (0.25, 0.5 or 1 fig; lanes 8-10). Total DNA was normalized to 1.5 p,g with 

20 pEGFP. Pelleted virus (above) and cytoplasmic extracts (labeled Cell, below) were harvested after 24 h, 
and levels of expressed CA, MA, and GFP-Vps4 proteins analyzed by Western blotting. (B) Mutant 
Vps4 proteins dominantly inhibit HIV-1 infectivity. Levels of infectious HIV-1 particles released into 
the supernatant were quantitated using single cycle MAGIC assays. Samples were from (A). (C) Mutant 
Vps4 proteins dominantly inhibit release of Gag-GFP. 293T cells were transfected with 1 \xg vector 

25 expressing wt Gag-GFP (lanes 2, 4-6) or a p6 PTAP mutant Gag-GFP (lane 3). Samples were co-trans fected 
with 1 )jg vector expressing GFP only (lane 2), wt GFP-Vps4 (lane 4), GFP-Vps4 K i73Q (lane 5), or GFP- 
Vps4 E 228Q (lane 6). Pelleted virus (above) and cytoplasmic extracts (below) were harvested after 24 h, and 
Gag and GFP-Vps4 expression were analyzed by Western blotting with anti-MA, anti-CA (top two panels) 
and anti-GFP antibodies (bottom panel). (D) PTAP- mutation and dominant negative Vps4 proteins inhibit 

30 release of infectious HIV-1 from MT4 T cells. Cells were transfected (24 well plates) with 1 fig HIV-1 R9 
DNA and 1 ug Vps4 DNA or 1 fig control DNA (pEGFP). Levels of infectious HIV-1 particles released into 
the supernatant were quantitated using single cycle MAGIC assays 72 h post transfection. 

1 3. Figure 5 shows Mutant Vps4 Proteins Dominantly Inhibit MLV Release and Infectivity. (A) 
293T cells were co-transfected with 1 .5 |Lig pCLeco and 0.25 jug pCLlacZ MLV vectors encoding wt 

35 (lanes 2,4-6) or PPPY" mutant MLV Gag proteins (lane 3). Co-transfections were with 1 [ig of the 

following vectors: pEGFP expressing GFP only (lanes 2, 3), wt GFP-Vps4 (lane 4), GFP-Vps4 K i73Q (lane 
5), or GFP-Vps4 E2 28Q (lane 6). Pelleted virus (upper panel) and cytoplasmic extracts (lower panels) were 
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harvested after 24 h, and levels of expressed MLV CA, Gag, and GFP-Vps4 proteins were analyzed by 
Western blotting with anti-MLV CA and anti-GFP antibodies as indicated. (B) Mutant Vps4 proteins 
dominantly inhibit MLV replication. Samples were from (A) and MLV vector titers were quantitated as 
described in Experimental Methods. 
5 14. Figure 6. TsglOl Depletion and Vps4 Mutants Block Virus Budding at a Late Stage. (A) 

Representative thin section electron micrographs showing the effects on HIV-1 budding of mutating the 
p6 PTAP sequence (left), depleting cellular TsglOl (center), or overexpressing the dominant negative 
Vps4 E22 8Q protein (right). In each case, virus budding arrested at a late stage, with immature particles 
remaining connected to the plasma membrane via membrane stalks or to other budding particles to form 
1 0 "clusters' 5 of interconnected particles, A subset of the cells transfected with the siRNA and mutant Vps4 
constructs exhibited what appeared to be Class E compartments in the plane of section, and a full 
description will be published elsewhere. Scale bars are all 100 nm. (B) Representative thin section 
electron micrographs showing the effects on MLV budding of mutating the pl2 PPPY motif (left) or 
overexpressing the dominant negative Vps4 E22 8Q protein (right). These MLV late domain phenotypes are 
15 similar to those described above for HIV-1 . 

15. Figure 7 shows that Vps4b mutants block budding of HIV GAG. Figure 7 A shows a Western 
blot of proteins isolated from 293T cells co-transfected with the indicated constructs. Figure 7B shows 
immunofluorescence of the GsRed and GFP labeled proteins. 

16. Figure 8 shows that Vps4b blocks virus budding and infectivity. Figure 8 A shows a Western 
20 blot of proteins isolated from cells co-transfected with HIV virus and the indicated constructs. Figure SB 

shows the results of a MAGIC assay of the co-transfected cells plotting the various constructs vs the 
infectivity titer. 

17. Figure 9 shows that BC2 overexpression blocks HIV budding. Figure 9B shows a Western 
blot of proteins isolated from cells after a GST pulldown experiment of cells cotransfected with the 

25 indicated constructs. Figure 9C shows a Western blot of proteins isolated from cells co-transfected with 
the indicated constructs and HIV virus. Figure 9D shows the results of a MAGIC infectivity assay on 
cells co-transfected with the indicated constructs. 

18. Figure 10 shows a schematic of molecules involved in viral budding. 

V. DETAILED DESCRIPTION 
30 19. The present invention may be understood more readily by reference to the following detailed 

description of preferred embodiments of the invention and the Examples included therein and to the 
Figures and their previous and following description. 

20. Before the present compounds, compositions, articles, devices, and/or methods are disclosed 
and described, it is to be understood that this invention is not limited to specific synthetic methods, 
35 specific recombinant biotechnology methods unless otherwise specified, or to particular reagents unless 
otherwise specified, as such may, of course, vary. It is also to be understood that the terminology used 
herein is for the purpose of describing particular embodiments only and is not intended to be limiting. 

— A — 
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A. Definitions 

21 . As used in the specification and the appended claims, the singular forms "a," "an" and "the" 
include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to "a 
pharmaceutical carrier 11 includes mixtures of two or more such carriers, and the like. 
5 22. Ranges may be expressed herein as from "about" one particular value, and/or to "about" 

another particular value. When such a range is expressed, another embodiment includes from the one 
particular value and/or to the other particular value. Similarly, when values are expressed as 
approximations, by use of the antecedent "about," it will be understood that the particular value forms 
another embodiment. It will be further understood that the endpoints of each of the ranges are significant 
. 10 both in relation to the other endpoint, and independently of the other endpoint. It is also understood that 
there are a number of values disclosed herein, and that each value is also herein disclosed as "about" that 
particular value in addition to the value itself For example, if the value "10" is disclosed, then "about 
10" is also disclosed. It is also understood that when a value is disclosed that "less than or equal to" the 
value, "greater than or equal to the value" and possible ranges between values are also disclosed, as 
15 appropriately understood by the skilled artisan. For example, if the value "10" is disclosed then "less 
than or equal to 10"as well as "greater than or equal to 10" is also disclosed. 

23. Throughout this application various assays and methods are disclosed for determining 
inhibition or inhibitors. It is understood that these methods are also disclosed as being practiced, for 
example, by modulating or modulators, or enhancing or enhancers (molecules that increase a certain 

20 activity). In other words, the particular activity assayed for, inhibition, modulation, or enhancing can all 
be equally used. 

24. In this specification and in the claims which follow, reference will be made to a number of 
terms which shall be defined to have the following meanings: 

25. "Optional" or "optionally" means that the subsequently described event or circumstance may 
25 or may not occur, and that the description includes instances where said event or circumstance occurs and 

instances where it does not. 

26. Fragments. It is understood that where peptides or nucleic acid molecules are described, 
fragments of these molecules are also disclosed, unless specifically indicated otherwise. For example, if 
a Bc2 is disclosed in a particular method or assay, it is understood that fragments of Bc2 are also 

30 disclosed therein, unless specifically indicated to the contrary. 

27. Throughout this application, various publications are referenced. The disclosures of these 
publications in their entireties are hereby incorporated by reference into this application in order to more 
fully describe the state of the art to which this invention pertains. The references disclosed are also 
individually and specifically incoiporated by reference herein for the material contained in them that is 

35 discussed in the sentence in which the reference is relied upon. 

28. It will be apparent to those skilled in the art that various modifications and variations can be 
made in the present invention without departing from the scope or spirit of the invention. Other 
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embodiments of the invention will be apparent to those skilled in the art from consideration of the 
specification and practice of the invention disclosed herein. It is intended that the specification and 
examples be considered as exemplary only, with a true scope and spirit of the invention being indicated 
by the following claims. 

5 29. Disclosed are the components to be used to prepare the disclosed compositions as well as the 

compositions themselves to be used within the methods disclosed herein. These and other materials are 
disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these 
materials are disclosed that while specific reference of each various individual and collective 
combinations and permutation of these compounds may not be explicitly disclosed, each is specifically 

10 contemplated and described herein. For example, if a particular TSG 101 variant is disclosed and 

discussed and a number of modifications that can be made to a number of molecules including the TSG 
101 variant are discussed, specifically contemplated is each and every combination and permutation of 
the TSG 101 variant and the modifications that are possible unless specifically indicated to the contrary. 
Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules D, E, and F and an 

15 example of a combination molecule, A-D is disclosed, then even if each is not individually recited each is 
individually and collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, 
and C-F are considered disclosed. Likewise, any subset or combination of these is also disclosed. Thus, 
for example, the sub-group of A-E, B-F, and C-E would be considered disclosed. This concept applies to 
all aspects of this application including, but not limited to, steps in methods of making and using the 

20 disclosed compositions. Thus, if there are a variety of additional steps that can be performed it is 
understood that each of these additional steps can be performed with any specific embodiment or 
combination of embodiments of the disclosed methods. 
B. Compositions and Methods 

30. Disclosed herein are mechanisms and pathways supported by data which are consistent with 
25 an ESCRT-1 complex selecting HIV Gag and other viral and non-viral PTAP proteins as cargo. TSG101 

binds HIV Gag-p6 and VPS28 during this selection. The VPS28 binding to the TSG101 S-box is tight 
(8nM) and long-lived (tl/2>lh), and VPS28 is unstable when not bound to the TSG101 S-box. It is also 
disclosed herein that VPS28 (and/or the S-box) are required for HIV budding. The data disclosed herein 
indicate that other Vps proteins are involved downstream of these interactions. ATPase-defective 
30 mutants of VPS4a and 4b disrupt virus budding. Furthermore, disclosed herein, BC2/CHMP2 binds to 
VPS4a/b and BC2/CHMP2 overexpression (or fusion to DsRed) inhibits HIV budding. Disclosed are 
compositions and methods for addressing the effect of Bc2 on viral budding. 

31. TsglOl binds to other viral and non-viral PTAP sequences. It also binds to human Vps28, 
tightly and with a long half life, and this complex is required for ESCRT-1 function in HIV budding. 

35 32. Disclosed are interactions between TSG101 and any other viral genes with a PTAP sequence, 

such as Ebola Vp40, Blue tongue virus, and PPPY viruses. 

— 6 — 
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33. The binding sites of TSG101, Vps4A and Vps4B, Bc2, Vps28, Vps37, and GAG molecules, 
as disclosed herein set forth targets for interaction with ligands which will be able to bind or modulate, 
such as inhibit, the disclosed interactions. 

34. A compound that is identified or designed as a result of any of the disclosed methods can be 
5 obtained (or synthesized) and tested for its biological activity, e.g., inhibition of TSG101-GAG 

interaction activity. 

1. TSGIOI-Ub interactions and effects 

35. Disclosed are methods for reducing interactions between TSGlOland Ub, comprising 
incubating an inhibitor of the interaction between TSG101 and Ub. Also disclosed are methods for 

10 inhibiting retroviral budding comprising administering an inhibitor of the interaction between TSG101 
and Ub. 

36. Disclosed are methods of treating a subject comprising administering to the subject an 
inhibitor of HIV budding, wherein the inhibitor reduces the interaction between TSG101 and Ub, and 
wherein the subject is in need of such treatment. 

15 37. Also disclosed are methods, wherein the inhibitor disrupts an interaction between TSG101 

and the Ub, and/or wherein the inhibitor interacts with the UEV domain of TSG10L 

38. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and Ub 
comprising incubating a library of molecules with TSG101 forming a mixture, and identifying the 
molecules that disrupt the interaction between TSG101 and Ub, wherein the interaction disrupted 

20 comprises an interaction between the Ub and an amino acid of TSG101 . 

39. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising Ub. 

40. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and Ub 
comprising incubating a library of molecules with Ub forming a mixture, and identifying the molecules 

25 that disrupt the interaction between Ub and TSG101, wherein the interaction disrupted comprises an 
interaction between the Ub and an amino acid of TSG101. 

41. Also disclosed are the methods, wherein the step of isolating comprises incubating the 
mixture with a molecule comprising a TSG 101 UEV domain. 

42. Also disclosed are compositions produced by any of the processes as disclosed herein, as 
30 well as compositions capable of being identified by the processes disclosed herein. 

43. It is understood that the disclosed methods and compositions are active for retroviruses 
dependent on GAG, such as HIV and Ebola. 

44. It is also understood that the disclosed methods and compositions can be further combined, 
with for example, PTAP domain addition, or any other protein or composition disclosed herein. 

35 Disclosed herein, Ub and PTAP domains bind TSG101 such that the binding of the two in the presence of 
each other is tight then the binding of either alone. Thus, disclosed are methods and assays where, for 
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example, both PTAP domains and Ub are incubated with target molecules or used in systems to assays 
for molecules that inhibit retroviral budding. 

45. Disclosed are methods of manufacturing a composition for inhibiting the interaction between 
TSG101 and Ub comprising synthesizing the inhibitors as disclosed herein. 
5 46. Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor as 

disclosed herein, and produced by any of the disclosed methods. 

47. Disclosed are methods of identifying inhibitors of TSGIOI-Ub interaction comprising, a) 
administering a composition to a system, wherein the system supports TSGIOI-Ub interaction, b) 
assaying the effect of the composition on the amount of TSGIOI-Ub is in the system, and c) selecting a 

10 composition which causes a decrease in the amount of TSGIOI-Ub present in the system relative to the 
system without the addition of the composition. 

48. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via a TSGIOI-Ub 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, and c) 

15 selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the TSGIOI-Ub interaction relative to the system without the addition of the 
composition. 

49. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 
wherein the composition prevents HIV budding, wherein the composition is defined as a composition 

20 capable of being identified by administering the composition to a system, wherein the system supports 

HIV-budding via a TSGIOI-Ub interaction, assaying the effect of the composition on the amount of HIV 
budding in the system, and selecting a composition which causes a decrease in the amount of HIV 
budding present in the system because of an inhibition of the TSGIOI-Ub interaction relative to the 
system without the addition of the composition. 

25 50. Also disclosed are methods of inhibiting HIV budding comprising administering a 

composition that reduces an interaction between TSG101 and Ub, 

5 1 . Disclosed are methods of making a composition capable of inhibiting HIV budding 
comprising admixing a compound with a pharmaceutically acceptable carrier, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 

30 TSG101 -Ub interaction, assaying the effect of the compound on the amount of HIV budding in the 

system, and selecting a compound which causes a decrease in the amount of HIV budding in the system 
because of an inhibition of the TSGIOI-Ub interaction, relative to the system without the addition of the 
compound. 

52. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 

35 administering a composition to a system, wherein the system supports HIV budding via a TSGIOI-Ub 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
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because of an inhibition of the TSGIOI-Ub interaction, relative to the system with the addition of the 
composition, and d) synthesizing the composition. 

53. Also disclosed are methods comprising the step of admixing the composition with a 

pharmaceutical carrier. 

5 54. Disclosed are methods of identifying an inhibitor of an interaction between TSGlOl and Ub 

comprising a) administering a composition to a system, wherein the system comprises TSGlOl, b) 
assaying the effect of the composition on a TSGIOI-Ub interaction, and c) selecting a composition which 
inhibits a TSGIOI-Ub interaction. 

55. Disclosed are cells that further comprising an inhibitor of a TSGIOI-Ub interaction. 
10 2. Vps4a and Vps4b ATPase activity and effects 

56. Disclosed are methods of identifying inhibitors of Vps4A or 4B ATPase activity comprising, 

a) administering a composition to a system, wherein the system supports Vps4A or 4B ATPase activity, 

b) assaying the effect of the composition on the amount of Vps4A or 4B ATPase activity in the system, 
and c) selecting a composition which causes a decrease in the amount of Vps4A or 4B ATPase activity 
present in the system relative to the system without the addition of the composition. Further disclosed 
are systems wherein the system comprises retroviral budding activity and the Vps4A or 4B ATPase 
activity is determined by assaying the retroviral budding activity. 

57! Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via a Vps4A or 4B 
ATPase activity, b) assaying the effect of the composition on the amount of HIV budding in the system, 
and c) selecting a composition which causes a decrease in the amount of HIV budding present in the 
system because of an inhibition of the Vps4A or 4B ATPase activity relative to the system without the 

addition of the composition. 

58. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 
wherein the composition prevents HIV budding, wherein the composition is defined as a composition 
capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a Vps4A or 4B ATPase activity interaction, assaying the effect of the composition on 
the amount of HIV budding in the system, and selecting a composition which causes a decrease in the 
amount of HIV budding present in the system because of an inhibition of the Vps4A or 4B ATPase 

30 activity relative to the system without the addition of the composition. 

59. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces Vps4A or 4B ATPase activity. 

60. Disclosed are methods of making a composition capable of inhibiting HIV budding 
comprising admixing a compound with a pharmaceutical^ acceptable carrier, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 
Vps4A or 4B ATPase activity, assaying the effect of the compound on the amount of HIV budding in the 
system, and selecting a compound which causes a decrease in the amount of HIV budding in the system 
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because of an inhibition of the Vps4A or 4B ATPase activity, relative to the system without the addition 
of the compound. 

61. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via a Vps4A or 4B 

5 ATPase activity, b) assaying the effect of the composition on the amount of HIV budding in the system, 
c) selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the Vps4A or 4B ATPase activity, relative to the system with the addition of 
the composition, and d) synthesizing the composition. 

62. Also disclosed are methods comprising the step of admixing the composition with a 
10 pharmaceutical carrier. 

63. Disclosed are methods of identifying an inhibitor of Vps4A or 4B ATPase activity 
comprising a) administering a composition to a system, wherein the system comprises Vps4A or 4B 
ATPase activity, b) assaying the effect of the composition on a Vps4A or 4B ATPase activity, and c) 
selecting a composition which inhibits a Vps4A or 4B ATPase activity. 

1 5 64. Disclosed are cells comprising, a) a regulatable nucleic acid comprising sequence encoding a 

Vps4A or 4B gene and b) a nucleic acid comprising sequence encoding a budding retrovirus as well as 
cells further comprising a regulatable nucleic acid comprising sequence encoding a TSG101 gene. 

65. Disclosed are cells that further comprising an inhibitor of a Vps4A or 4B ATPase activity. 
3. Vps4a-Vps4b interactions and effects 
20 66. Disclosed are methods for reducing interactions between Vps4a and Vps4b, comprising 

incubating an inhibitor of the interaction between Vps4a and Vps4b. Also disclosed are methods for 
inhibiting retroviral budding comprising administering an inhibitor of the interaction between Vps4a and 
Vps4b. 

67. Disclosed are methods of treating a subject comprising administering to the subject an 

25 inhibitor of HIV budding, wherein the inhibitor reduces the interaction between Vps4a and Vps4b, and 
wherein the subject is in need of such treatment. 

68. Also disclosed are methods, wherein the inhibitor disrupts an interaction between Vps4a and 
theVps4b. 

69. Disclosed are methods of identifying an inhibitor of an interaction between Vps4a and Vps4b 
30 comprising incubating a library of molecules with Vps4a forming a mixture, and identifying the 

molecules that disrupt the interaction between Vps4a and Vps4b, wherein the interaction disrupted 
comprises an interaction between the Vps4b and an amino acid of Vps4a. 

70. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising Vps4a and/or Vps4b. 

35 71. Disclosed are methods of identifying an inhibitor of an interaction between Vps4a and Vps4b 

comprising incubating a library of molecules with Vps4b forming a mixture, and identifying the 
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molecules that disrupt the interaction between Vps4a and Vps4b, wherein the interaction disrupted 
comprises an interaction between the Vps4a and an amino acid of Vps4b. 

72. Also disclosed are compositions produced by any of the processes as disclosed herein, as 
well as compositions capable of being identified by the processes disclosed herein. 
5 73. It is understood that the disclosed methods and compositions are active for retroviruses 

dependent on GAG, such as HIV and Ebola. 

74. It is also understood that the disclosed methods and compositions can be further combined, 
with for example, any other protein or composition disclosed herein, such as the PTAP domain. Thus, 
disclosed are methods and assays where, for example, Vps4a and Vps4b and Ub are incubated with target 

10 molecules or used in systems to assays for molecules that inhibit retroviral budding. 

75. Disclosed are methods of manufacturing a composition for inhibiting the interaction between 
Vps4a and Vps4b comprising synthesizing the inhibitors as disclosed herein. 

76. Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor as 
disclosed herein, and produced by any of the disclosed methods. 

15 11 * Disclosed are methods of identifying inhibitors of Vps4a-Vps4b interaction comprising, a) 

administering a composition to a system, wherein the system supports Vps4a-Vps4b interaction, b) 
assaying the effect of the composition on the amount of Vps4a-Vps4b is in the system, and c) selecting a 
composition which causes a decrease in the amount of Vps4a-Vps4b present in the system relative to the 
system without the addition of the composition. 

20 78. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 

administering a composition to a system, wherein the system supports HIV budding via a Vps4a~Vps4b 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, and c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the Vps4a-Vps4b interaction relative to the system without the addition of the 

25 composition. 

79. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 
wherein the composition prevents HIV budding, wherein the composition is defined as a composition 
capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a Vps4a-Vps4b interaction, assaying the effect of the composition on the amount of 

30 HIV budding in the system, and selecting a composition which causes a decrease in the amount of HIV 
budding present in the system because of an inhibition of the Vps4a^Vps4b interaction relative to the 
system without the addition of the composition. 

80. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces an interaction between Vps4a and Vps4b. 

35 81 . Disclosed are methods of making a composition capable of inhibiting HIV budding 

comprising admixing a compound with a pharmaceutically acceptable carrier, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 
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Vps4a-Vps4b interaction, assaying the effect of the compound on the amount of HIV budding in the 
system, and selecting a compound which causes a decrease in the amount of HIV budding in the system 
because of an inhibition of the Vps4a-Vps4b interaction, relative to the system without the addition of the 
compound. 

5 82. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 

administering a composition to a system, wherein the system supports HIV budding via a Vps4a-Vps4b 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the Vps4a-Vps4b interaction, relative to the system with the addition of the 
1 0 composition, and d) synthesizing the composition. 

83. Also disclosed are methods comprising the step of admixing the composition with a 
pharmaceutical carrier. 

84. Disclosed are methods of identifying an inhibitor of an interaction between Vps4a and Vps4b 
comprising a) administering a composition to a system, wherein the system comprises Vps4a, b) assaying 

15 the effect of the composition on a Vps4a-Vps4b interaction, and c) selecting a composition which inhibits 
a Vps4a-Vps4b interaction. 

85. Disclosed are methods of identifying an inhibitor of an interaction between Vps4a and Vps4b 
comprising a) administering a composition to a system, wherein the system comprises Vps4b, b) assaying 
the effect of the composition on a Vps4a-Vps4b interaction, and c) selecting a composition which inhibits 

20 a Vps4a-Vps4b interaction. 

86. Disclosed are cells that further comprising an inhibitor of a Vps4a- Vps4b interaction. 

4. Vps4a-Bc2 interactions and effects 

87. Disclosed are methods for reducing interactions between Vps4a and Bc2, comprising 
incubating an inhibitor of the interaction between Vps4a and Bc2. Also disclosed are methods for 

25 inhibiting retroviral budding comprising administering an inhibitor of the interaction between Vps4a and 
Bc2 or administration of Bc2, such as excess Bc2. 

88. Disclosed are methods of treating a subject comprising administering to the subject an 
inhibitor of HIV budding, wherein the inhibitor reduces the interaction between Vps4a and Bc2, and 
wherein the subject is in need of such treatment, and \yherein the inhibitor is Bc2 or analog. 

30 89. Also disclosed are methods, wherein the inhibitor disrupt's an interaction between Vps4a and 

the Bc2. 

90. Disclosed are methods of identifying an inhibitor of an interaction between Vps4a and Bc2 
comprising incubating a library of molecules with Vps4a forming a mixture, and identifying the 
molecules that disrupt the interaction between Vps4a and Bc2, wherein the interaction disrupted 

35 comprises an interaction between the Bc2 and an amino acid of Vps4a. 

91. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising Bc2 or fragment thereof. 
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92. Disclosed are methods of identifying an inhibitor of an interaction between Vps4a and Bc2 
comprising incubating a library of molecules with Bc2 forming a mixture, and identifying the molecules 
that disrupt the interaction between Vps4a and Bc2, wherein the interaction disrupted comprises an 
interaction between the Bc2 and an amino acid of Vps4a. 
5 93. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 

with a molecule comprising Vps4a or fragment thereof 

94. Also disclosed are compositions produced by any of the processes as disclosed herein, as 
well as compositions capable of being identified by the processes disclosed herein. 

95. It is understood that the disclosed methods and compositions are active for retroviruses 
10 dependent on GAG, such as HIV and Ebola. 

96. Disclosed are methods of manufacturing a composition for inhibiting the interaction between 
Vps4a and Bc2 comprising synthesizing the inhibitors as disclosed herein. 

97. Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor as 
disclosed herein, and produced by any of the disclosed methods. 

15 98. Disclosed are methods of identifying inhibitors of Vps4a and Bc2 interaction comprising, a) 

administering a composition to a system, wherein the system supports Vps4a-Bc2 interaction, b) assayiag 
the effect of the composition on the amount of Vps4a~Bc2 is in the system, and c) selecting a 
composition which causes a decrease in the amount of Vps4a-Bc2 present in the system relative to the 
system without the addition of the composition. 

20 99. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 

administering a composition to a system, wherein the system supports HIV budding via a Vps4a-Bc2 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, and c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the Vps4a-Bc2 interaction relative to the system without the addition of the 

25 composition. 

100, Disclosed are methods of inhibiting HIV budding comprising administering a composition, 
wherein the composition prevents HIV budding, wherein the composition is defined as a composition 
capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a Vps4a-Bc2 interaction, assaying the effect of the composition on the amount of HIV 

30 budding in the system, and selecting a composition which causes a decrease in the amount of HIV 
budding present in the system because of an inhibition of the Vps4a-Bc2 interaction relative to the 
system without the addition of the composition. 

101. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces an interaction between Vps4a-Bc2. 

35 102, Disclosed are methods of making a composition capable of inhibiting HIV budding 

comprising admixing a compound with a pharmaceutical^ acceptable carrier, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 
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Vps4a-Bc2interaction, assaying the effect of the compound on the amount of HIV budding in the system, 
and selecting a compound which causes a decrease in the amount of HIV budding in the system because 
of an inhibition of the Vps4a-Bc2interaction, relative to the system without the addition of the compound. 
103. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 
5 administering a composition to a system, wherein the system supports HIV budding via a Vps4a-Bc2 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the Vps4a-Bc2 interaction, relative to the system with the addition of the 
composition, and d) synthesizing the composition. 
10 104. Also disclosed are methods comprising the step of admixing the composition with a 

pharmaceutical carrier. 

105. Disclosed are methods of identifying an inhibitor of an interaction between Vps4a and Bc2 
comprising a) administering a composition to a system, wherein the system comprises Vps4a, b) assaying 
the effect of the composition on a Vps4a-Bc2 interaction, and c) selecting a composition which inhibits a 

1 5 Vps4a-Bc2 interaction . 

106. Disclosed are cells that further comprising an inhibitor of a Vps4a-Bc2 interaction or an 
inhibitor of Vps4a-Bc2 induced viral budding. 

5- Vps4b-Bc2 interactions and effects 

107. Disclosed are methods for reducing interactions between Vps4b and Bc2, comprising 
20 incubating an inhibitor of the interaction between Vps4b and Bc2. Also disclosed are methods for 

inhibiting retroviral budding comprising administering an inhibitor of the interaction between Vps4b and 
Bc2 or administration of Bc2, such as excess Bc2. 

108. Disclosed are methods of treating a subject comprising administering to the subject an 
inhibitor of HIV budding, wherein the inhibitor reduces the interaction between Vps4b and Bc2, and 

25 wherein the subject is in need of such treatment, and wherein the inhibitor is Bc2 or analog. 

109. Also disclosed are methods, wherein the inhibitor disrupts an interaction between Vps4b 
and the Bc2. 

1 1 0. Disclosed are methods of identifying an inhibitor of an interaction between Vps4b and Bc2 
comprising incubating a library of molecules with Vps4b forming a mixture, and identifying the 

30 molecules that disrupt the interaction between Vps4b and Bc2, wherein the interaction disrupted 
comprises an interaction between the Bc2 and an amino acid of Vps4b. 

111. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising Bc2 or fragment thereof. 

112. Disclosed are methods of identifying an inhibitor of an interaction between Vps4b and Bc2 
35 comprising incubating a library of molecules with Bc2 forming a mixture, and identifying the molecules 

that disrupt the interaction between Vps4b and Bc2, wherein the interaction disrupted comprises an 
interaction between the Bc2 and an amino acid of Vps4b. 
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113. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising Vps4b or fragment thereof. 

114. Also disclosed are compositions produced by any of the processes as disclosed herein, as 
well as compositions capable of being identified by the processes disclosed herein. 

1 1 5. It is understood that the disclosed methods and compositions are active for retroviruses 
dependent on GAG, such as HIV and Ebola. 

116. Disclosed are methods of manufacturing a composition for inhibiting the interaction 
between Vps4b and Bc2 comprising synthesizing the inhibitors as disclosed herein. 

117. Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor 
as disclosed herein, and produced by any of the disclosed methods. 

118. Disclosed are methods of identifying inhibitors of Vps4b and Bc2 interaction comprising, a) 
administering a composition to a system, wherein the system supports Vps4b-Bc2 interaction, b) assaying 
the effect of the composition on the amount of Vps4b-Bc2 is in the system, and c) selecting a 
composition which causes a decrease in the amount of Vps4b-Bc2 present in the system relative to the 
system without the addition of the composition. 

1 19. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via a Vps4b-Bc2 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, and c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the Vps4b-Bc2 interaction relative to the system without the addition of the 
composition. 

120. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 
wherein the composition prevents HIV budding, wherein the composition is defined as a composition 
capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a Vps4b-Bc2 interaction, assaying the effect of the composition on the amount of HIV 
budding in the system, and selecting a composition which causes a decrease in the amount of HIV 
budding present in the system because of an inhibition of the Vps4b-Bc2 interaction relative to the 
system without the addition of the composition. 

121. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces an interaction between Vps4b-Bc2. 

122. Disclosed are methods of making a composition capable of inhibiting HIV budding 
comprising admixing a compound with a pharmaceutical ly acceptable carrier, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 
Vps4b-Bc2interaction, assaying the effect of the compound on the amount of HIV budding in the system, 
and selecting a compound which causes a decrease in the amount of HIV budding in the system because 
of an inhibition of the Vps4b-Bc2interaction, relative to the system without the addition of the 
compound. 
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1 23. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via a Vps4b-Bc2 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 

5 because of an inhibition of the Vps4b-Bc2 interaction, relative to the system with the addition of the 
composition, and d) synthesizing the composition. 

124. Also disclosed are methods comprising the step of admixing the composition with a 
pharmaceutical carrier. 

125. Disclosed are methods of identifying an inhibitor of an interaction between Vps4b and Bc2 
10 comprising a) administering a composition to a system, wherein the system comprises Vps4b, b) assaying 

the effect of the composition on a Vps4b-Bc2 interaction, and c) selecting a composition which inhibits a 
Vps4b-Bc2 interaction. 

1 26. Disclosed are cells that further comprising an inhibitor of a Vps4b-Bc2 interaction or an 
inhibitor of Vps4b-Bc2 induced viral budding. 

15 6. TSG101-GAG interactions and effects 

127. Disclosed are methods for reducing interactions between TSG101 and HIV GAG, 
comprising incubating an inhibitor of the interaction between TSG101 and GAG with TSG101, GAG, or 
TSG101 and HIV GAG. 

128. Also disclosed are methods for inhibiting HIV budding comprising administering an 
20 inhibitor of the interaction between TSG 1 0 1 and HIV GAG. 

129. Disclosed are methods of treating a subject comprising administering to the subject an 
inhibitor of HIV budding. 

130. Disclosed are methods wherein the inhibitor prevents expression of TSG101 mRNA, 
wherein the inhibitor interacts with the mRNA of TSG10, wherein the inhibitor is an interfering RNA. 

25 131. Disclosed are methods for inhibiting viral budding in a virus comprising proteins that 

interact with TSG101, comprising administering an inhibitor of the Vpsl or Vps 4 A or 4B pathway in a 
cell. 

132. Disclosed are methods, wherein the virus is HIV or Ebola. 

133. Disclosed are methods, wherein the inhibitor prevents normal trafficking of TSG101, 
30 wherein the inhibitor reduces the release of TSG101 from endosomes, wherein the inhibitor prevents 

normal Vps4A or 4B function, wherein the inhibitor interacts with Vps4A or 4B. 

134. Disclosed are methods for reducing interactions between TSG101 and HIV GAG, 
comprising incubating an inhibitor of the interaction between TSG101 and GAG. Also disclosed are 
methods for inhibiting HIV budding comprising administering an inhibitor of the interaction between 

35 TSG 101 and HIV GAG. 1 
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135. Disclosed are methods of treating a subject comprising administering to the subject an 

inhibitor of HIV budding, wherein the inhibitor reduces the interaction between TSG101 and HIV GAG, 
and wherein the subject is in need of such treatment. 

136. Also disclosed are methods, wherein the HIV-GAG comprises a P6 region, wherein the 

5 inhibitor disrupts an interaction between TSG101 and the P6 region of HIV GAG, wherein the HIV-GAG 
comprises a PTAP domain, wherein the inhibitor disrupts an interaction between TSG101 and the PTAP 
domain, and/or wherein the inhibitor interacts with the UEV domain of TSG101. 

137. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
retroviral GAG comprising incubating a library of molecules with TSG101 forming a mixture, and 

10 identifying the molecules that disrupt the interaction between TSG101 and retroviral GAG, wherein the 
interaction disrupted comprises an interaction between the retroviral GAG and an amino acid of TSG10L 

138. Disclosed are methods of identifying an inhibitor of Vsp4 ATPase activity comprising 
incubating a library of molecules with ATPase forming a mixture, and identifying the molecules that 
inhibit the ATPase activity. Also disclosed are methods wherein the ATPase activity is determined 

15 through assaying viral budding, such as HIV viral budding, as disclosed herein. 

139. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising a PTAP domain. 

140. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
retroviral GAG comprising incubating a library of molecules with retroviral GAG forming a mixture, and 

20 identifying the molecules that disrupt the interaction between retroviral GAG and TSG101, wherein the 
interaction disrupted comprises an interaction between the retroviral GAG and an amino acid of TSG101. 

141. Also disclosed are the methods, wherein the step of isolating comprises incubating the 
mixture with a molecule comprising a TSG 101 UEV domain. 

142. Also disclosed are compositions produced by any of the processes as disclosed herein, as 
25 well as compositions capable of being identified by the processes disclosed herein. 

143. It is understood that the disclosed methods and compositions are active for retroviruses 
dependent on GAG, such as HIV and Ebola. 

144. It is also understood that the disclosed methods and compositions can be further combined, 
with for example, Ubiquitin addition. Disclosed herein, Ub and PTAP domains bind TSG101 such that 

30 the binding of the two in the presence of each other is tight then the binding of either alone. Thus, 

disclosed are methods and assays where, for example, both PTAP domains and Ub are incubated with 
target molecules or used in systems to assays for molecules that inhibit retroviral budding. Also 
disclosed are tripartite systems, wherein Ub, TSG101 and GAG or fragments thereof are present or 
utilized in the disclosed method in any of the possible permutations. 

35 145. Disclosed are methods of manufacturing a composition for inhibiting the interaction 

between TSG101 and GAG comprising synthesizing the inhibitors as disclosed herein. 
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146. Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor 
as disclosed herein, and produced by any of the disclosed methods. 

147. Disclosed are methods of identifying inhibitors of TSG101-GAG interaction comprising, a) 
administering a composition to a system, wherein the system supports TSG101-GAG interaction, b) 

5 assaying the effect of the composition on the amount of TSG101-GAG is in the system, and c) selecting a 
composition which causes a decrease in the amount of TSG101 -GAG present in the system relative to the 
system without the addition of the composition. 

148. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via a TSG101-GAG 

10 interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, and c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the TSG101-GAG interaction relative to the system without the addition of 
the composition. 

149. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 
1 5 wherein the composition prevents HIV budding, wherein the composition is defined as a composition 

capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a TSG101-GAG interaction, assaying the effect of the composition on the amount of 
HIV budding in the system, and selecting a composition which causes a decrease in the amount of HIV 
budding present in the system because of an inhibition of the TSG101-GAG interaction relative to the 
20 system without the addition of the composition. 

150. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces an interaction between TSG101 and GAG. 

151. Disclosed are methods of making a composition capable of inhibiting HIV budding 
comprising admixing a compound with a pharmaceutically acceptable carrier, wherein the compound is 

25 identified by administering the compound to a system, wherein the system supports HIV budding via a 
TSG101-GAG interaction, assaying the effect of the compound on the amount of HIV budding in the 
system, and selecting a compound which causes a decrease in the amount of HIV budding in the system 
because of an inhibition of the TSG101-GAG interaction, relative to the system without the addition of 
the compound. 

30 152. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 

administering a composition to a system, wherein the system supports HIV budding via a TSG101-GAG 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the TSG101-GAG interaction, relative to the system with the addition of the 

35 composition, and d) synthesizing the composition. 

153. Also disclosed are methods comprising the step of admixing the composition with a 
pharmaceutical carrier. 
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154. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
GAG comprising a) administering a composition to a system, wherein the system comprises TSG 101, b) 
assaying the effect of the composition on a TSG101-GAG interaction, and c) selecting a composition 
which inhibits a TSG 1 0 1 -GAG interaction. 
5 155. Disclosed are methods, wherein the GAG comprises a P6 region, wherein the inhibitor 

disrupts an interaction between TSG101 and the P6 region of GAG, and/or wherein the GAG comprises a 
PTAP domain, wherein the inhibitor disrupts an interaction between TSG101 and the PTAP domain. 

156. Disclosed are cells comprising, a) a regulatable nucleic acid comprising sequence encoding 
a TSG101 gene and b) a nucleic acid comprising sequence encoding a GAG gene as well as cells 

10 comprising a regulatable nucleic acid comprising sequence encoding a TSG101 gene and sequence 
comprising a GAG gene. 

157. Disclosed are cells that further comprising an inhibitor of a TSG101-GAG interaction. 

7. TSG101-Vps28 interactions and effects 

158. Disclosed are methods for reducing interactions between TSG101 and Vps28, comprising 
15 incubating an inhibitor of the interaction between TSG101 and Vps28. Also disclosed are methods for 

inhibiting retroviral budding comprising administering an inhibitor of the interaction between TSG101 
and Vps28. 

159. Disclosed are methods of treating a subject comprising administering to the subject an 
inhibitor of HIV budding, wherein the inhibitor reduces the interaction between TSG101 and Vps28, and 

20 wherein the subject is in need of such treatment. 

160. Also disclosed are methods, wherein the inhibitor disrupts an interaction between TSG10I 
.and the Vps28, and/or wherein the inhibitor interacts with the UEV domain of TSG10L 

161. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
Vps28 comprising incubating a library of molecules with TSG101 forming a mixture, and identifying the 

25 molecules that disrupt the interaction between TSG101 and Vps28, wherein the interaction disrupted 
comprises an interaction between the Vps28 and an amino acid of TSG101. 

162. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising Vps28. 

163. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
30 Vps28 comprising incubating a library of molecules with Vps28 forming a mixture, and identifying the 

molecules that disrupt the interaction between Vps28 and TSG 101, wherein the interaction disrupted 
comprises an interaction between the Vps28 and an amino acid of TSG101. 

164. Also disclosed are the methods, wherein the step of isolating comprises incubating the 
mixture with a molecule comprising a TSG 101 UEV domain. 

35 1 65 * AIs0 disclosed are compositions produced by any of the processes as disclosed herein, as 

well as compositions capable of being identified by the processes disclosed herein. 
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166. It is understood that the disclosed methods and compositions are active tor retroviruses 
dependent on GAG, such as HIV and Ebola. 

167. It is also understood that the disclosed methods and compositions can be further combined, 
with for example, PTAP domain addition. Disclosed herein, Vps28 and PTAP domains bind TSG101 
such that the binding of the two in the presence of each other is tight then the binding of either alone. 
Thus, disclosed are methods and assays where, for example, both PTAP domains and Vps28 are 
incubated with target molecules or used in systems to assays for molecules that inhibit retroviral budding. 

168. Disclosed are methods of manufacturing a composition for inhibiting the interaction 
between TSG101 and Vps28 comprising synthesizing the inhibitors as disclosed herein. 

169. Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor 
as disclosed herein, and produced by any of the disclosed methods. 

170. Disclosed are methods of identifying inhibitors of TSG101-Vps28 interaction comprising, 
a) administering a composition to a system, wherein the system supports TSG101-Vps28 interaction, b) 
assaying the effect of the composition on the amount of TSG101-Vps28 is in the system, and c) selecting 
a composition which causes a decrease in the amount of TSG101-Vps28 present in the system relative to 
the system without the addition of the composition. 

171. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via a TSG101-Vps2S 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, and c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the TSG101-Vps28 interaction relative to the system without the addition of 
the composition. 

172. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 
wherein the composition prevents HIV budding, wherein the composition is defined as a composition 
capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a TSG101-Vps28 interaction, assaying the effect of the composition on the amount of 
HIV budding in the system, and selecting a composition which causes a decrease in the amount of HIV 
budding present in the system because of an inhibition of the TSG101-Vps28 interaction relative to the 
system without the addition of the composition. 

173. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces an interaction between TSG101 and Vps28. 

174. Disclosed are methods of making a composition capable of inhibiting HIV budding 
comprising admixing a compound with a pharmaceutically acceptable carrier, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 
TSG101-Vps28 interaction, assaying the effect of the compound on the amount of HIV budding in the 
system, and selecting a compound which causes a decrease in the amount of HIV budding in the system 
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because of an inhibition of the TSG101-Vps28 interaction, relative to the system without the addition of 
the compound. 

175. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via a TSG101-Vps28 

5 interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the TSG101-Vps28 interaction, relative to the system with the addition of the 
composition, and d) synthesizing the composition. 

176. Also disclosed are methods comprising the step of admixing the composition with a 
10 pharmaceutical carrier. 

177. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
Vps28 comprising a) administering a composition to a system, wherein the system comprises TSG101, b) 
assaying the effect of the composition on a TSG101-Vps28 interaction, and c) selecting a composition 
which inhibits a TSG101-Vps2S interaction. 

15 178. Disclosed are cells that further comprising an inhibitor of a TSG101-Vps28 interaction. 

179. TSG101-Vps37 interactions and effects 

180. Disclosed are methods for reducing interactions between TSG101 and Vps37, comprising 
incubating an inhibitor of the interaction between TSG101 and Vps37. Also disclosed are methods for 
inhibiting retroviral budding comprising administering an inhibitor of the interaction between TSG101 

20 and Vps37. 

181. Disclosed are methods of treating a subject comprising administering to the subject an 
inhibitor of HIV budding, wherein the inhibitor reduces the interaction between TSG101 and Vps37, and 
wherein the subject is in need of such treatment. 

182. Also disclosed are methods, wherein the inhibitor disrupts an interaction between TSG101 
25 and the Vps37, and/or wherein the inhibitor interacts with the UEV domain of TSG101 . 

183. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
Vps37 comprising incubating a library of molecules with TSG101 forming a mixture, and identifying the 
molecules that disrupt the interaction between TSG101 and Vps37, wherein the interaction disrupted 
comprises an interaction between the Vps37 and an amino acid of TSG10L 

30 184, Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 

with a molecule comprising Vps37. 

185. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
Vps37 comprising incubating a library of molecules with Vps37 forming a mixture, and identifying the 
molecules that disrupt the interaction between Vps37 and TSG101, wherein the interaction disrupted 

35 comprises an interaction between the Vps37 and an amino acid of TSG101. 

186. Also disclosed are the methods, wherein the step of isolating comprises incubating the 
mixture with a molecule comprising a TSG 101 UEV domain. 
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187. Also disclosed are compositions produced by any of the processes as disclosed herein, as 
well as compositions capable of being identified by the processes disclosed herein. 

188. It is understood that the disclosed methods and compositions are active for retroviruses 
dependent on GAG, such as HIV and Ebola. 

5 1 89. It is also understood that the disclosed methods and compositions can be further combined, 

with for example, PTAP domain addition. Disclosed herein, Vps37 and PTAP domains bind TSG101 
such that the binding of the two in the presence of each other is tight then the binding of either alone. 
Thus, disclosed are methods and assays where, for example, both PTAP domains and Vps37 are 
incubated with target molecules or used in systems to assays for molecules that inhibit retroviral budding. 

10 1 90. Disclosed are methods of manufacturing a composition for inhibiting the interaction 

between TSG101 and Vps37 comprising synthesizing the inhibitors as disclosed herein. 

191 . Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor 
as disclosed herein, and produced by any of the disclosed methods. 

192. Disclosed are methods of identifying inhibitors of TSG101-Vps37 interaction comprising, 
15 a) administering a composition to a system, wherein the system supports TSG101-Vps37 interaction, b) 

assaying the effect of the composition on the amount of TSG101-Vps37 is in the system, and c) selecting 
a composition which causes a decrease in the amount of TSG101-Vps37 present in the system relative to 
the system without the addition of the composition. 

193. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 

20 administering a composition to a system, wherein the system supports HIV budding via a TSG101-Vps37 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, and c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the TSG101-Vps37 interaction relative to the system without the addition of 
the composition. 

25 194. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 

wherein the composition prevents HIV budding, wherein the composition is defined as a composition 
capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a TSG101-Vps37 interaction, assaying the effect of the composition on the amount of 
HIV budding in the system, and selecting a composition which causes a decrease in the amount of HIV 

30 budding present in the system because of an inhibition of the TSG101~Vps37 interaction relative to the 
system without the addition of the composition. 

195. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces an interaction between TSG101 and Vps37. 

196. Disclosed are methods of making a composition capable of inhibiting HIV budding 

35 comprising admixing a compound with a pharmaceutical^ acceptable earner, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 
TSG101-Vps37 interaction, assaying the effect of the compound on the amount of HIV budding in the 
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system, and selecting a compound which causes a decrease in the amount of HIV budding in the system 
because of an inhibition of the TSG101-Vps37 interaction, relative to the system without the addition of 
the compound. 

197. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 
5 administering a composition to a system, wherein the system supports HIV budding via a TSG101-Vps37 
interaction, b) assaying the effect of the composition on the amount of HIV budding in the system, c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the TSG101-Vps37 interaction, relative to the system with the addition of the 
composition, and d) synthesizing the composition. 
10 198. Also disclosed are methods comprising the step of admixing the composition with a 

pharmaceutical carrier. 

199. Disclosed are methods of identifying an inhibitor of an interaction between TSG101 and 
Vps37 comprising a) administering a composition to a system, wherein the system comprises TSG101, b) 
assaying the effect of the composition on a TSG101-Vps37 interaction, and c) selecting a composition 

1 5 which inhibits a TSG 1 0 1 - Vps3 7 interaction. 

200. Disclosed are cells that further comprise an inhibitor of a TSG101-Vps37 interaction. 

8. Escrt 1 complex interactions and effects 

201. Disclosed are methods for reducing interactions within the Escrt 1 complex, comprising 
incubating an inhibitor of the interaction with Escrt 1 complex. Also disclosed are methods for inhibiting 

20 retroviral budding comprising administering an inhibitor of the interaction within the Escrt 1 complex. 

202. Disclosed are methods of treating a subject comprising administering to the subject an 
inhibitor of HIV budding, wherein the inhibitor reduces the interaction within the Escrt 1 complex, and 
wherein the subject is in need of such treatment. 

203. Also disclosed are methods, wherein the inhibitor disrupts an interaction within the Escrt 1 
25 complex, and/or wherein the inhibitor interacts with the UEV domain of TSG101, and/or S-box domain 

orf TSG101, and/or the PTAP domain of TSG10L 

204. Disclosed are methods of identifying an inhibitor of an interaction within the Escrt 1 
complex comprising incubating a library of molecules with Escrt 1 complex forming a mixture, and 
identifying the molecules that disrupt the interaction within the Escrt 1 complex, wherein the interaction 

30 disrupted comprises an interaction between an amino acid of a protein within the Escrt 1 complex. 

205. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising a molecule comprising a protein within the Escrt 1 complex or the Escrt 1 
complex itself. 

206. Disclosed are methods of identifying an inhibitor of an interaction within the Escrt 1 

35 complex comprising incubating a library of molecules with the Escrt 1 complex forming a mixture, and 
identifying the molecules that disrupt the interaction within the Escrt 1 complex, wherein the interaction 
disrupted comprises an interaction with an amino acid of a protein within the Escrt 1 complex. 
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207. Also disclosed are the methods, wherein the step of isolating comprises incubating the 
mixture with a molecule comprising a TSG 101 UEV domain, and/or a TSG101-PTAP domain, and/or a 
TSGIOI-S-Box domain. 

208. Also disclosed are compositions produced by any of the processes as disclosed herein, as 
5 well as compositions capable of being identified by the processes disclosed herein. 

209. It is understood that the disclosed methods and compositions are active for retroviruses 
dependent on GAG, such as HIV and Ebola. 

210. It is also understood that the disclosed methods and compositions can be further combined, 
with ay of the disclosed methods and compositions herein. 

10 211. Disclosed are methods of manufacturing a composition for inhibiting the interaction within 

the Escrt 1 complex comprising synthesizing the inhibitors as disclosed herein. 

212. Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor 
as disclosed herein, and produced by any of the disclosed methods. 

213. Disclosed are methods of identifying inhibitors of an interaction within a Escrt 1 complex 
15 comprising, a) administering a composition to a system, wherein the system supports Escrt 1 complex 

formation, b) assaying the effect of the composition on the amount of Escrt 1 complex formed in the 
system, and c) selecting a composition which causes a decrease in the amount of Escrt 1 complex present 
in the system relative to the system without the addition of the composition. 

214. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 

20 administering a composition to a system, wherein the system supports HIV budding via Escrt 1 complex 
formation, b) assaying the effect of the composition on the amount of HIV budding in the system, and c) 
selecting a composition which causes a decrease in the amount of HIV budding present in the system 
because of an inhibition of the Escrt 1 complex relative to the system without the addition of the 
composition. 

25 215. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 

wherein the composition prevents HIV budding, wherein the composition is defined as a composition 
capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a TSG101-Vps37 interaction, assaying the effect of the composition on the amount of 
HIV budding in the system, and selecting a composition which causes a decrease in the amount of HIV 

30 budding present in the system because of an inhibition of the TSG101-Vps37 interaction relative to the 
system without the addition of the composition. 

216. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces an interaction within the Escrt 1 complex. 

217. Disclosed are methods of making a composition capable of inhibiting HIV budding 

35 comprising admixing a compound with a pharmaceutical!/ acceptable carrier, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 
TSG101«Vps37 interaction, assaying the effect of the compound on the amount of HIV budding in the 
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system, and selecting a compound which causes a decrease in the amount of HIV budding in the system 
because of an inhibition of the TSG101-Vps37 interaction, relative to the system without the addition of 
the compound. 

218. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 
5 administering a composition to a system, wherein the system supports HIV budding via an Escrt 1 
complex formation, b) assaying the effect of the composition on the amount of HIV budding in the 
system, c) selecting a composition which causes a decrease in the amount of HIV budding present in the 
system because of an inhibition of the Escrt 1 complex formation, relative to the system with the addition 
of the composition, and d) synthesizing the composition. 
10 219. Also disclosed are methods comprising the step of admixing the composition with a 

pharmaceutical carrier. 

220. Disclosed are methods of identifying an inhibitor of an interaction within the Escrt 1 
complex comprising a) administering a composition to a system, wherein the system comprises an Escrt 
1 complex, b) assaying the effect of the composition on the Escrt 1 complex, and c) selecting a 

15 composition which inhibits a Escrt I complex formation. 

221. Disclosed are cells that fiirther comprise an inhibitor of the Escrt complex formation. 

9. Downstream Escrt complex interactions and effects 

222. Disclosed are methods for reducing interactions within the Downstream Escrt complex, 
comprising incubating an inhibitor of the interaction with the Downstream Escrt complex. Also 

20 disclosed are methods for inhibiting retroviral budding comprising administering an inhibitor of the 
interaction within the Downstream Escrt complex. 

223. Disclosed are methods of treating a subject comprising administering to the subject an 
inhibitor of HIV budding, wherein the inhibitor reduces the interaction within the Downstream Escrt 
complex, and wherein the subject is in need of such treatment. 

25 224. Also disclosed are methods, wherein the inhibitor disrupts an interaction within the 

Downstream Escrt complex, and/or wherein the inhibitor interacts with the UEV domain of TSG101, 
and.or S-box domain orf TSG101, and/or the PTAP domain of TSG101. 

225. Disclosed are methods of identifying an inhibitor of an interaction within the Downstream 
Escrt complex comprising incubating a library of molecules with the Downstream Escrt complex 

30 forming a mixture, and identifying the molecules that disrupt the interaction within the Downstream Escrt 
complex, wherein the interaction disrupted comprises an interaction between an amino acid of a protein 
within the Downstream Escrt complex. 

226. Also disclosed are methods, wherein the step of isolating comprises incubating the mixture 
with a molecule comprising a molecule comprising a protein within the Downstream Escrt complex or 

35 the Downstream Escrt complex itself. 

227. Disclosed are methods of identifying an inhibitor of an interaction within the Downstream 
Escrt complex comprising incubating a library of molecules with the Downstream Escrt complex 
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forming a mixture, and identifying the molecules that disrupt the interaction within the Downstream Escrt 

complex, wherein the interaction disrupted comprises an interaction with an amino acid of a protein 

within the Downstream Escrt complex. 

228. Also disclosed are the methods, wherein the step of isolating comprises incubating the 

5 mixture with a molecule comprising a TSG 101 UEV domain, and/or a TSG101-PTAP domain, and/or a 
TSGIOI-S-Box domain. 

229. Also disclosed are compositions produced by any of the processes as disclosed herein, as 
well as compositions capable of being identified by the processes disclosed herein. 

230. It is understood that the disclosed methods and compositions are active for retroviruses 
10 dependent on GAG, such as HIV and Ebola. 

23 1. It is also understood that the disclosed methods and compositions can be further combined, 
with ay of the disclosed methods and compositions herein. 

232. Disclosed are methods of manufacturing a composition for inhibiting the interaction within 
the Downstream Escrt complex comprising synthesizing the inhibitors as disclosed herein. 

15 233. Also disclosed are methods that include mixing a pharmaceutical carrier with the inhibitor 

as disclosed herein, and produced by any of the disclosed methods. 

234. Disclosed are methods of identifying inhibitors of an interaction within a Downstream Escrt 
complex comprising, a) administering a composition to a system, wherein the system supports 
Downstream Escrt complex formation, b) assaying the effect of the composition on the amount of 

20 Downstream Escrt complex formed in the system, and c) selecting a composition which causes a 
decrease in die amount of Downstream Escrt complex present in the system relative to the system 
without the addition of the composition. 

235. Also disclosed are methods of identifying inhibitors of HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via Downstream 

25 Escrt complex formation, b) assaying the effect of the composition on the amount of HIV budding in the 
system, and c) selecting a composition which causes a decrease in the amount of HIV budding present in 
the system because of an inhibition of the Downstream Escrt complex relative to the system without the 
addition of the composition. 

236. Disclosed are methods of inhibiting HIV budding comprising administering a composition, 
30 wherein the composition prevents HIV budding, wherein the composition is defined as a composition 

capable of being identified by administering the composition to a system, wherein the system supports 
HIV-budding via a TSG101-Vps37 interaction, assaying the effect of the composition on the amount of 
HIV budding in the system, and selecting a composition which causes a decrease in the amount of HIV 
budding present in the system because of an inhibition of the TSG101-Vps37 interaction relative to the 
35 system without the addition of the composition. 

237. Also disclosed are methods of inhibiting HIV budding comprising administering a 
composition that reduces an interaction within the Downstream Escrt complex. 
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238. Disclosed are methods of making a composition capable of inhibiting HIV budding 

comprising admixing a compound with a pharmaceutical^ acceptable carrier, wherein the compound is 
identified by administering the compound to a system, wherein the system supports HIV budding via a 
TSG101-Vps37 interaction, assaying the effect of the compound on the amount of HIV budding in the 
system, and selecting a compound which causes a decrease in the amount of HIV budding in the system 
because of an inhibition of the TSG101-Vps37 interaction, relative to the system without the addition of 
the compound. 

239. Disclosed are methods of manufacturing an inhibitor to HIV budding comprising, a) 
administering a composition to a system, wherein the system supports HIV budding via an Downstream 
Escrt complex formation, b) assaying the effect of the composition on the amount of HIV budding in the 
system, c) selecting a composition which causes a decrease in the amount of HIV budding present in the 
system because of an inhibition of the Downstream Escrt complex formation, relative to the system with 
the addition of the composition, and d) synthesizing the composition. 

240. Also disclosed are methods comprising the step of admixing the composition with a 
15 pharmaceutical carrier. 

241. Disclosed are methods of identifying an inhibitor of an interaction within the Downstream 
Escrt complex comprising a) administering a composition to a system, wherein the system comprises an 
Downstream Escrt complex, b) assaying the effect of the composition on the Downstream Escrt 
complex, and c) selecting a composition which inhibits a Downstream Escrt complex formation. 

20 242. Disclosed are cells that further comprise an inhibitor of a Escrt complex formation. 

10. General Interactions 

243. Disclosed herein are a variety of molecules and a variety of compositions and a variety of 
methods. It is understood that the general principles disclosed by these methods can be applied to any 
molecule or set of molecules or composition or set of compositions disclosed herein. For example, the 

25 disclosed methods could be applied to TSG101-Vps4a or Vps4b interactions, or they could be applied to 
any interaction with Ub, such as a Ub-Vps4a interaction, a Ub-Vps4b interaction, a Ub-GAG interaction, 
as well as Vps28-Vps4a or Vps4b interactions, or any interaction between a protein or molecule in the 
Escrt 1 complex and any molecule in the downstream Escrt complex. It is also understood that a variety 
of functions exist for these molecules and compositions disclosed herein, such as binding, promotion of 

30 viral budding, promotion of GAG maturation, or ATPase function. There are number of assays disclosed 
herein related to these functions, including binding assays, protein production assays, and viral infectivity 
assays. Also it is understood that methods and compositions regarding, for example, Bc2-Ub, Bc2-GAG, 
Vsp4a~TSG101, Ub-GAG are disclosed. 
C. Compositions 

35 l. TSG101 

244. TSG101 comprises a number of domains. TSG101 can have a PTAP binding domain, 

which functions to interact with the PTAP region of a retroviral GAG protein. It is understood that the 

TSG10I PTAP binding domain can bind any protein containing the PTAP region. This region is a part of 
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retroviral GAG proteins as well as proteins in enveloped viruses TSG101 also can have ubiquitin binding 
domain, which can interact with ubiquitin. 

245. The direct interaction between TsglOl and the PTAP late domain motifs on the viral 
structural proteins is mediated by the N-terminal ubiquitin E2 variant (UEV) domain of TsglOl 
5 (VerPlank et al., 2001 ; Garrus et al., 2001; Martin-Serrano et al., 2001), whereas the remaining C- 

terminal two-thirds of TsglOl interacts with other proteins (Fig. la). Like other UEV domains, TsglOl 
UEV shows significant sequence similarity to E2 ubiquitin ligases but is unable to catalyze ubiquitin 
transfer, as it lacks the active site cysteine that forms the transient thioester bond with the C-terminus of 
ubiquitin (Ub) (Koonin and Abagyan, 1997; Ponting et al., 1997) (Fig. 16). Nevertheless, at least some 

10 UEVs have retained the ability to bind Ub, and appear to act either as co-factors in ubiquitination 

reactions, or as ubiquitin sensors (Hofmann and Pickart, 1999; VerPlank et al., 2001; Katzmann et al., 
2001; VanDemark et al., 2001 ; Garrus et al, 2001). UEV domains also frequently contain other protein 
recognition motifs, and may generally serve to couple protein and Ub binding functions to facilitate the 
formation of multiprotein complexes. 

15 246. TsglOl normally functions in the cellular vacuolar protein sorting (Vps) pathway, which 

coordinates the sorting of membrane-associated proteins through a series of endosomal compartments for 
eventual delivery to the lysosome (vacuole in yeast) (Dupre et al., 2001 ; Hicke, 2001; Piper and Luzio, 
2001). A key decision-making step in this pathway occurs when membrane patches containing proteins 
destined for destruction bud as small vesicles into the lumen of the late endosome, creating an organelle 

20 called the "multivesicular body" (MVB). Subsequent fusion of MVB with the lysosome delivers these 
vesicles and all associated cargo to the lumen of the lysosome, where they are degraded by proteolysis. 
In contrast, proteins that remain in die limiting membrane of MVB are delivered to the limiting 
membrane of lysosomes and therefore escape degradation. 

247. Work, particularly in the yeast system, has revealed that TgslOl (yeast Vps23p) performs 

25 an important role in selecting which proteins enter the MVB lumen and which remain on the limiting 
membrane. In yeast, proteins destined for the lumen of the vacuole are covalently modified with 
monoubiquitin, and bound by a 350-kDa protein complex called ESCRT-I* which is composed of 
Vps23p, Vps28p and Vps37p (Katzmann et al., 2001). ESCRT-I appears to function as the receptor 
and/or sorting complex that selects ubiquitinated proteins for incorporation into MVB vesicles. This 

30 sorting event appears absolutely dependent on the Ub-binding activity of Vps23p, as adding an 

ubiquitination signal targets proteins that would otherwise remain on the limiting membrane into the 
lumen. Conversely, a point mutation in Vps23p that abolishes Ub binding also blocks sorting of 
ubiquitinated cargo proteins (Katzmann et al., 2001). Although the human system is less well 
characterized, TsglOl also binds Ub in vitro (Garrus et al., 2001), forms a soluble 350-kDa complex that 

35 includes Vps28 (Babst et ah, 2000; Bishop and Woodman, 2001), and is required for the delivery of 
cathepsin D and endocytosed receptors to the lysosome (Babst et al, 2000). Thus, it appears that 
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ESCRT-I complex function is conserved from yeast to humans, and that recognition of ubiquitinated 
proteins is an important element in this function. 

248. TsglOl was initially discovered in a screen for potential tumor suppressors (Li and Cohen, 
1996), and the protein appears to perform multiple functions, including down-regulating p53 via the 
MDM2/p53 pathway (Li et al., 2001; Ruland et al., 2001). TsglOl also plays a central role in vacuolar 
protein sorting. The Vps pathway sorts membrane-bound proteins for eventual degradation in the 
lysosome (vacuole in yeast) ((Lemmon and Traub, 2000) and references therein). Two alternative 
entrees into the Vps pathway are via vesicular trafficking from the Golgi (e.g., in destroying misfolded 
membrane proteins) or via endocytosis from the plasma membrane (e.g., in downregulating surface 
receptors). Vesicles carrying proteins from either source can enter the Vps pathway by fusing with 
endosomes. As these endosomes mature, their cargos are sorted for lysosomal degradation via the 
formation of structures called multivesicular bodies (MVB). MVB are created when surface patches on 
late endosomes bud into the lumen, forming small (-50-100 nm) vesicles. A maturing MVB can contain 
tens or even hundreds of these vesicles. The MVB then fuses with the lysosome, releasing the vesicles 
for degradation in this hydrolytic organelle. 

249. Ub also plays an important role in the budding of retroviruses and other enveloped viruses 
(Harty et al., 2000; Patnaik et al., 2000; Schubert et al., 2000; Strack et al., 2000; Vogt, 2000; Harty et al., 
2001; Kikonyogo et al., 2001). Retroviruses contain high levels of ubiquitin, and 2-5% of Gag proteins 
in the virion are monoubiquitinated (Putterman et al., 1990; Ott et al., 1998; Ott et al., 2000). Moreover, 
treatment of infected cells with proteasome inhibitors, which decreases the intracellular concentration of 
free Ub, also inhibits virus release at a late stage (Schubert et al., 2000; Harty et al., 2001). In some cases 
this effect can be partially rescued either by overexpressing free Ub or by fusing Ub to the C-terminal 
end of the viral Gag protein (Patnaik et al., 2000). 

2. HIV budding 

250. Like other enveloped viruses, HIV-1 uses cellular machinery to bud from infected cells. 
Disclosed herein TsglOl protein functions in vacuolar protein sorting (Vps) and is required for HIV-1 
budding. The UEV domain of TsglOl binds to an essential tetrapeptide (PTAP) motif within the p6 
domain of the structural Gag protein and also to ubiquitin. Disclosed herein, depletion of cellular TsglOl 
by a small interfering RNA arrests HIV-1 budding at a late stage, and budding is rescued by 
reintroduction of TsglOl. Also disclosed are dominant negative mutant Vps4 proteins, as well as other 
molecules, that inhibit vacuolar protein sorting also arrest HIV-1 and MLV budding. These observations 
are consistent with retroviruses budding by appropriating cellular machinery normally used in the Vps 
pathway to form multivesicular bodies. 

251. HIV-1 assembly is driven by the viral Gag protein, which is actively trafficked to the 
plasma membrane where it associates into enveloped, spherical particles that bud from the cell (reviewed 
in (Freed, 1998)). During viral assembly, Gag is processed by the viral protease at a series of sites to 
produce four new structural proteins that perform essential functions in the mature, infectious virion 
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(denoted MA, CA, NC, and p6; Fig. 1 A). Proteolytic processing is not required for particle production, 
however, as HIV-1 Gag can assemble and bud in the absence of any other viral proteins. 

252. The budding of an enveloped virus can be viewed as a fission event in which the continuous 
cell membrane is broken and resealed to create discrete viral and cellular membranes. Like other 

5 enveloped viruses, HIV-1 does not encode its own membrane fission machinery and presumably 
therefore must recruit and reprogram cellular proteins to assist in the budding process. A potential 
docking site for such cellular factor(s) has been mapped to a conserved P(T/S)AP motif located in the p6 
domain of HIV-1 Gag. Point mutations within this "PTAP" motif arrest viral release at a very late stage 
(Gottlinger et al., 1991; Huang et aL, 1995), and it has therefore been termed a "late domain" (Wills and 
1 0 Craven, 1991). Virus assembly appears to initiate normally in the late domain mutants, but the 

continuous membrane that connects the budding particles to the cell (or to other budding particles) is not 
severed, resulting in abnormal or severely attenuated virus release. 

253. The recruitment of cellular machinery to facilitate virus budding seems to be a general 
phenomenon, and distinct late domains have been identified in the structural proteins of many enveloped 

15 viruses (Vogt, 2000). Two well characterized late domains are the "PY" motif (consensus sequence: 
PPXY; X= any amino acid) found in membrane-associated proteins from filo-, orbK rhabdo- and 
oncoretroviruses (Craven et al., 1999; Harty et al-, 2000; Harty et al., 1999; Jayakar et al., 2000), and the 
"YL" motif (YXXL) found in the Gag protein of equine infectious anemia virus (EIAV) (Puffer et al., 
1997; Puffer et al., 1998). The various late domains can still function when moved to different positions 

20 within retroviral Gag proteins, supporting the idea that they are docking sites for cellular factors rather 

than structural elements (Parent et al., 1995; Yuan et al., 2000). Moreover, the different late domains can 
function interchangeably and multiple late domains are often found in close proximity within viral coat 
proteins, suggesting that they may act synergistically (Parent et al., 1995; Strack et al., 2000; Yuan et al., 
2000). 

25 254. Ubiquitin (Ub) also plays an essential, albeit poorly understood, role in retroviral budding 

(Patnaik et al., 2000; Schubert et aL, 2000b; Strack et al., 2000; Vogt, 2000). For example, retrovirus 
budding can be blocked at a late stage by depleting cellular pools of free ubiquitin with proteasome 
inhibitors. It is not yet certain, however, whether the functionally relevant substrate for ubiquitination is 
Gag itself or a cellular factor. HIV-1 p6 and other retroviral Gag proteins are monoubiquitinated at low 

30 levels (Ott et al., 2000) and there is a general correlation between Gag ubiquitination and virus release 
(Schubert et al., 2000b; Strack et aL, 2000). Moreover, the block imposed by proteasome inhibitors can 
be at least partially alleviated by covalently fusing Ub to the C-terminal end of Rous sarcoma virus 
(RSV) Gag (Patnaik et aL, 2000). However, virus release and replication are not affected by mutation of 
the two HIV-1 p6 lysine residues (Lys-27 and Lys-33) that are the major sites for Gag ubiquitination (Ott 

35 et aL, 2000). The functional role of Gag ubiquitination therefore remains uncertain. In principle, Gag 
ubiquitination could facilitate budding either by targeting defective Gag molecules for proteolytic 
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degradation and thereby preventing them from interfering with viral budding (Schubert et ah, 2000a), or 
by creating docking sites for cellular factors that actively participate in viral budding. 

3. Vps4 

255. The yeast Vps4p is now known to have two 48/49kD, 80% identical orthologs in humans, 

5 Vps4a and Skd 1 (Vps4b). Vps4a and Vps4b belong to a family of ATPases with more than 200 members 
with a conserved 200 amino acid catalytic domain called AAA (ATPases associated with various cellular 
activities). These domains are involved in a variety of functions, include membrane trafficking, 
proteasome function, the MT motor dynein, and NSF which refolds and resets SNARE complexes after 
membrane fusion. Most AAA ATPases form hexameric or other oligomeric rings; and Vps4 is thought 

1 0 to catalyze the release of membrane-bound proteins from the MVB. Vps4a andVps4b bind CHMP 1 

(CHarged MVB Proteins) and BC2/CHMP2 proteins. (See Babst and Emr, EMBO J. 1997, 16(8):1820-3L, 
EMBO J. 1998, 17(1 l):2982-93., 1998; Bishop & Woodman, Mol Biol Cell. 2000; 1 l(l):227-39., J Biol 
Chem. 2001 Apr 13;276(15):1 1735-42.; and Howard et ak, J. Cell Sci 2001;114(Pt 13):2395-404., all of 
which are herein incorporated by reference for material at least related to Vps4, Bc2, and TSG101, and 

1 5 other protein complexes) 

256. Highly conserved lysine to glutamine and glutamate to glutamine mutations block exit of 
proteins from the vacuole or MVB in yeast, mouse and human cells, and thus block protein sorting in a 
dominant negative way. This blockage causes the MVB(or vacuole in yeast) to keep accumulating 
vesicles inside, which causes the organelles to swell and grow. The mutants affect ATP-binding and are 

20 K173Q, K180Q; and ATP-hydrolysis E228Q, E235Q. 

4. Molecules that can interact with TSG101 and/or PTAP and/or Ub, and/or Vps4A, 
and/or Vps4B or Vps28 or Vps37 or Bc2 or Ubiquitin or GAG. 

a) Functional Nucleic Acids 

257. Functional nucleic acids are nucleic acid molecules that have a specific function, such as 
25 binding a target molecule or catalyzing a specific reaction. Functional nucleic acid molecules can be 

divided into the following categories, which are not meant to be limiting. For example, functional 
nucleic acids include antisense molecules, aptamers, ribozymes, triplex forming molecules, and external 
guide sequences. The functional nucleic acid molecules can act as affectors, inhibitors, modulators, and 
stimulators of a specific activity possessed by a target molecule, or the functional nucleic acid molecules 
30 can possess a de novo activity independent of any other molecules. 

258. Functional nucleic acid molecules can interact with any macromolecule, such as DNA, 
RNA, polypeptides, or carbohydrate chains. Thus, functional nucleic acids can interact with the mRNA 
of any of the proteins and compositions disclosed herein, such as, TSG101 or Vps4A or Vps4B or 
fragment thereof, or the genomic DNA of any of the proteins and compositions disclosed herein, such as, 

35 TSG101 or Vps4A or Vps4B or fragment thereof or they can interact with any of the proteins and 

compositions disclosed herein, such as, the polypeptide TSG1 01 or Vps4A or Vps4B or fragment thereof, 

such as the PTAP binding domain or the UB binding domain. Often functional nucleic acids are 

designed to interact with other nucleic acids based on sequence homology between the target molecule 
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and the functional nucleic acid molecule. In other situations, the specific recognition between the 
functional nucleic acid molecule and the target molecule is not based on sequence homology between the 
functional nucleic acid molecule and the target molecule, but rather is based on the formation of tertiary 
structure that allows specific recognition to take place. 
5 259. Antisense molecules are designed to interact with a target nucleic acid molecule through 

either canonical or non-canonical base pairing. The interaction of the antisense molecule and the target 
molecule is designed to promote the destruction of the target molecule through, for example, RNAseH 
mediated RNA-DNA hybrid degradation. Alternatively the antisense molecule is designed to interrupt a 
processing function that normally would take place on the target molecule, such as transcription or 
10 replication. Antisense molecules can be designed based on the sequence of the target molecule. 

Numerous methods for optimization of antisense efficiency by finding the most accessible regions of the 
target molecule exist. Exemplary methods would be in vitro selection experiments and DNA 
modification studies using DMS and DEPC. It is preferred that antisense molecules bind the target 
molecule with a dissociation constant (k d )less than 10" c , It is more preferred that antisense molecules 
15 bind with a k d less than 10" 8 . It is also more preferred that the antisense molecules bind the target 
molecule with a k d less than 10" 10 . It is also preferred that the antisense molecules bind the target 
molecule with a ka less than 10" 12 . A representative sample of methods and techniques which aid in. the 
design and use of antisense molecules can be found in the following non-limiting list of United States 
patents: 5,135,917, 5,294,533, 5,627,158, 5,641,754, 5,691,317, 5,780,607, 5,786,138, 5,849,903, 

20 5,856,103, 5,919,772, 5,955,590, 5,990,088, 5,994,320, 5,998,602, 6,005,095, 6,007,995, 6,013,522, 
6,017,898, 6,018,042, 6,025,198, 6,033,910, 6,040,296, 6,046,004, 6,046,319, and 6,057,437. 

260. Aptamers are molecules that interact with a target molecule, preferably in a specific way. 
Typically aptamers are small nucleic acids ranging from 15-50 bases in length that fold into defined 
secondary and tertiary structures, such as stem-loops or G-quartets. Aptamers can bind small molecules, 

25 such as ATP (United States patent 5,63 1 , 146) and theophiline (United States patent 5,580,737), as well as 
large molecules, such as reverse transcriptase (United States patent 5,786,462) and thrombin (United 
States patent 5,543,293). Aptamers can bind very tightly with kdS from the target molecule of less than 
10~ 12 M. It is preferred that the aptamers bind the target molecule with a kd less than 10~ 6 . It is more 
preferred that the aptamers bind the target molecule with a k d less than 10' 8 . It is also more preferred that 

30 the aptamers bind the target molecule with a k d less than 10" 10 . It is also preferred that the aptamers bind 
the target molecule with a kd less than 10" 12 . Aptamers can bind the target molecule with a very high 
degree of specificity. For example, aptamers have been isolated that have greater than a 10000 fold 
difference in binding affinities between the target molecule and another molecule that differ at only a 
single position on the molecule (United States patent 5,543,293). It is preferred that the aptamer have a 

35 k d with the target molecule at least 10 fold lower than the k d with a background binding molecule. It is 
more preferred that the aptamer have a kd with the target molecule at least 100 fold lower than the k<, with 
a background binding molecule. It is more preferred that the aptamer have a k d with the target molecule 
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at least 1000 fold lower than the ka with a background binding molecule. It is preferred that the aptamer 
have a k d with the target molecule at least 10000 fold lower than the k<j with a background binding 
molecule. It is preferred when doing the comparison for a polypeptide for example, that the background 
molecule be a different polypeptide. For example, when determining the specificity of TSG101 or 
5 Vps4A or Vps4B aptamers, such as TSG101 PTAP binding domain or UB binding domain aptamers, the 
background protein could be albumin. Representative examples of how to make and use aptamers to 
bind a variety of different target molecules can be found in the following non-limiting list of United 
States patents: 5,476,766, 5,503,978, 5,631,146, 5,731,424 , 5,780,228, 5,792,613, 5,795,721, 5,846,713, 
5,858,660 , 5,861,254, 5,864,026, 5,869,641, 5,958,691, 6,001,988, 6,01 1,020, 6,013,443, 6,020,130, 
1 0 6,028, 1 86, 6,030,776, and 6,05 1 ,698 . 

261. Ribozymes are nucleic acid molecules that are capable of catalyzing a chemical reaction, 
either intramolecularly or intermolecularly. Ribozymes are thus catalytic nucleic acid. It is preferred that 
the ribozymes catalyze intermolecular reactions. There are a number of different types of ribozymes that 
catalyze nuclease or nucleic acid polymerase type reactions which are based on ribozymes found in 

15 natural systems, such as hammerhead ribozymes, (for example, but not limited to the following United 
States patents: 5,334,711, 5,436,330, 5,616,466, 5,633,133, 5,646,020, 5,652,094, 5,712,384, 5,770,715, 
5,856,463, 5,861,288, 5,891,683, 5,891,684, 5,985,621, 5,989,908, 5,998,193, 5,998,203, WO-9858058 
by Ludwig and Sproat, WO 9858057 by Ludwig and Sproat, and WO 971 83 12 by Ludwig and Sproat) 
hairpin ribozymes (for example, but not limited to the following United States patents: 5,631,1 15, 

20 5,646,031, 5,683,902, 5,712,384, 5,856,188, 5,866,701, 5,869,339, and 6,022,962), and tetrahymena 

ribozymes (for example, but not limited to the following United States patents: 5,595,873 and 5,652,107). 
There are also a number of ribozymes that are not found in natural systems, but which have been 
engineered to catalyze specific reactions de novo (for example, but not limited to the following United 
States patents: 5,580,967, 5,688,670, 5,807,718, and 5,910,408). Preferred ribozymes cleave RNA or 

25 DNA substrates, and more preferably cleave RNA substrates. Ribozymes typically cleave nucleic acid 
substrates through recognition and binding of the target substrate with subsequent cleavage. This 
recognition is often based mostly on canonical or non-canonical base pair interactions. This property 
makes ribozymes particularly good candidates for target specific cleavage of nucleic acids because 
recognition of the target substrate is based on the target substrates sequence. Representative examples of 

30 how to make and use ribozymes to catalyze a variety of different reactions can be found in the following 
non-limiting list of United States patents: 5,646,042, 5,693,535, 5,731,295, 5,81 1,300, 5,837,855, 
5,869,253, 5,877,021, 5,877,022, 5,972,699, 5,972,704, 5,989,906, and 6,017,756. 

262. Triplex forming functional nucleic acid molecules are molecules that can interact with either 
double-stranded or single-stranded nucleic acid. When triplex molecules interact with a target region, a 

35 structure called a triplex is formed, in which there are three strands of DNA forming a complex 

dependant on both Watson-Crick and Hoogsteen base-pairing. Triplex molecules are preferred because 
they can bind target regions with high affinity and specificity. It is preferred that the triplex forming 
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molecules bind the target molecule with a k d less than 10" 6 . It is more preferred that the triplex forming 
molecules bind with a k d less than 10~ 8 . It is also more preferred that the triplex forming molecules bind 
the target molecule with a k<j less than 10" 10 . It is also preferred that the triplex forming molecules bind 
the target molecule with a k d less than 10' 12 . Representative examples of how to make and use triplex 
5 forming molecules to bind a variety of different target molecules can be found in the following non- 
limiting list of United States patents: 5,176,996, 5,645,985, 5,650,316, 5,683,874, 5,693,773, 5,834,185, 
5,869,246, 5,874,566, and 5,962,426. 

263. External guide sequences (EGSs) are molecules that bind a target nucleic acid molecule 
forming a complex, and this complex is recognized by RNase P, which cleaves the target molecule. 

10 EGSs can be designed to specifically target a RNA molecule of choice. RNAse P aids in processing 
transfer RNA (tRNA) within a cell. Bacterial RNAse P can be recruited to cleave virtually any RNA 
sequence by using an EGS that causes the target RNA:EGS complex to mimic the natural tRNA 
substrate. (WO 92/03566 by Yale, and Forster and Altman, Science 238:407-409 (1990)). 

264. Similarly, eukaryotic EGS/RNAse P-directed cleavage of RNA can be utilized to cleave 

1 5 desired targets within eukaryotic cells. (Yuan et al., Proc. Natl. Acad. Sci. USA 89:8006-8010 (1992); 
WO 93/22434 by Yale; WO 95/24489 by Yale; Yuan and Altman, EMBO J 14:159-168 (1995), and 
Carrara et al.. Proc. Natl. Acad. Sci. (USA) 92:2627-2631 (1995)). Representative examples of how to 
make and use EGS molecules to facilitate cleavage of a variety of different target molecules be found in 
the following non-limiting list of United States patents: 5,168,053, 5,624,824, 5,683,873, 5,728,521, 
20 5,869,248, and 5,877,162 

b) Antibodies 

(1) Antibodies Generally 

265. The term "antibodies" is used herein in a broad sense and includes both polyclonal and 
monoclonal antibodies. In addition to intact immunoglobulin molecules, also included in the term 

25 "antibodies" are fragments or polymers of those immunoglobulin molecules, and human or humanized 

versions of immunoglobulin molecules or fragments thereof, as long as they are chosen for their ability to 
interact with any of the proteins and compositions disclosed herein, such as, TSG101 or Vps4A or 
Vps4B or GAG, such that TSG101 or Vps4A or Vps4B or GAG is inhibited from promoting retroviral 
budding. Antibodies that bind the disclosed regions of any of the proteins and compositions disclosed 

30 herein, such as, TSG101 or Vps4A or Vps4B or GAG involved in the viral budding are also disclosed. 
The antibodies can be tested for their desired activity using the in vitro assays described herein, or by 
analogous methods, after which their in vivo therapeutic and/or prophylactic activities are tested 
according to known clinical testing methods. Also disclosed are functional equivalents of antibodies. 

266. The term "monoclonal antibody" as used herein refers to an antibody obtained from a 

35 substantially homogeneous population of antibodies, i.e., the individual antibodies within the population 
are identical except for possible naturally occurring mutations that may be present in a small subset of the 
antibody molecules. The monoclonal antibodies herein specifically include "chimeric" antibodies in 
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which a portion of the heavy and/or light chain is identical with or homologous to corresponding 
sequences in antibodies derived from a particular species or belonging to a particular antibody class or 
subclass, while the remainder of the chain(s) is identical with or homologous to corresponding sequences 
in antibodies derived from another species or belonging to another antibody class or subclass, as well as 
fragments of such antibodies, as long as they exhibit the desired antagonistic activity (See, U.S. Pat. No. 
4,816,567 and Morrison etal., Proc. Natl. Acad. Sci. USA, 81:6851-6855 (1984)). 

267. The disclosed monoclonal antibodies can be made using any procedure which produces 
mono clonal antibodies. For example, monoclonal antibodies of the invention can be prepared using 
hybridoma methods, such as those described by Kohler and Milstein, Nature, 256:495 (1975). In a 
hybridoma method, a mouse or other appropriate host animal is typically immunized with an immunizing 
agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent. 

268. The monoclonal antibodies may also be made by recombinant DNA methods, such as those 
described in U.S. Pat. No. 4,816,567 (Cabilly et al.). DNA encoding the disclosed monoclonal antibodies 
can be readily isolated and sequenced using conventional procedures (e.g., by using oligonucleotide 
probes that are capable of binding specifically to genes encoding the heavy and light chains of murine 
antibodies). Libraries of antibodies or active antibody fragments can also be generated and screened 
using phage display techniques, e.g., as described in U.S. Patent No. 5,804,440 to Burton et al. and U.S. 
Patent No. 6,096,441 to Barbas et al. 

269. In vitro methods are also suitable for preparing monovalent antibodies. Digestion of 
antibodies to produce fragments thereof, particularly, Fab fragments, can be accomplished using routine 
techniques known in the art. For instance, digestion can be performed using papain. Examples of papain 
digestion are described in WO 94/29348 published Dec. 22, 1994 and U.S. Pat. No. 4,342,566. Papain 
digestion of antibodies typically produces two identical antigen binding fragments, called Fab fragments, 
each with a single antigen binding site, and a residual Fc fragment. Pepsin treatment yields a fragment 
that has two antigen combining sites and is still capable of cross-linking antigen. 

270. The fragments, whether attached to other sequences or not, can also include insertions, 
deletions, substitutions, or other selected modifications of particular regions or specific amino acids 
residues, provided the activity of the antibody or antibody fragment is not significantly altered or 

30 impaired compared to the non-modified antibody or antibody fragment. These modifications can provide 
for some additional property, such as to remove/add amino acids capable of disulfide bonding, to increase 
its bio-longevity, to alter its secretory characteristics, etc. In any case, the antibody or antibody fragment 
must possess a bioactive property, such as specific binding to its cognate antigen. Functional or active 
regions of the antibody or antibody fragment may be identified by mutagenesis of a specific region of the 
protein, followed by expression and testing of the expressed polypeptide. Such methods are readily 
apparent to a skilled practitioner in the art and can include site-specific mutagenesis of the nucleic acid 
encoding the antibody or antibody fragment. (Zoller, M.J. Curr. Opin. Biotechnol. 3:348-354, 1992). 
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271. As used herein, the term "antibody" or "antibodies" can also refer to a human antibody 
and/or a humanized antibody. Many non-human antibodies (e.g., those derived from mice, rats, or 
rabbits) are naturally antigenic in humans, and thus can give rise to undesirable immune responses when 
administered to humans. Therefore, the use of human or humanized antibodies in the methods of the 

5 invention serves to lessen the chance that an antibody administered to a human will evoke an undesirable 
immune response. 

(2) Human antibodies 

272. The human antibodies of the invention can be prepared using any technique. Examples of 
techniques for human monoclonal antibody production include those described by Cole et al, 

10 {Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77, 1985) and by Boerner et al. {J. 

Immunol., 147(l):86-95, 1991). Human antibodies of the invention (and fragments thereof) can also be 
produced using phage display libraries (Hoogenboom et al.,/. Mot. Biol, 227:381, 1991; Marks et al.,7. 
Mol Biol, 222:581, 1991). 

273. The human antibodies of the invention can also be obtained from transgenic animals. For 
1 5 example, transgenic, mutant mice that are capable of producing a full repertoire of human antibodies, in 

response to immunization, have been described (see, e.g., Jakobovits et al., Proc. Natl Acad. Set USA, 
90:2551-255 (1993); Jakobovits et al., Nature, 362:255-258 (1993); Bruggermann et ah, Year, in 
Immunol., 7:33 (1993)). Specifically, the homozygous deletion of the antibody heavy chain joining 
region (J(HJ) gene in these chimeric and germ-line mutant mice results in complete inhibition of 
20 endogenous antibody production, and the successful transfer of the human germ-line antibody gene array 
into such germ-line mutant mice results in the production of human antibodies upon antigen challenge. 

(3) Humanized antibodies 

274. Antibody humanization techniques generally involve the use of recombinant DNA 
technology to manipulate the DNA sequence encoding one or more polypeptide chains of an antibody 

25 molecule. Accordingly, a humanized form of a non-human antibody (or a fragment thereof) is a chimeric 
antibody or antibody chain (or a fragment thereof, such as an Fv, Fab, Fab', or other antigen-binding 
portion of an antibody) which contains a portion of an antigen binding site from a non-human (donor) 
antibody integrated into the framework of a human (recipient) antibody. 

275. To generate a humanized antibody, residues from one or more complementarity determining 
30 regions (CDRs) of a recipient (human) antibody molecule are replaced by residues from one or more 

CDRs of a donor (non-human) antibody molecule that is known to have desired antigen binding 
characteristics (e.g., a certain level of specificity and affinity for the target antigen). In some instances, 
Fv framework (FR) residues of the human antibody are replaced by corresponding non-human residues. 
Humanized antibodies may also contain residues which'are found neither in the recipient antibody nor in 
35 the imported CDR or framework sequences. Generally, a humanized antibody has one or more amino 

acid residues introduced into it from a source which is non-human. In practice, humanized antibodies are 
typically human antibodies in which some CDR residues and possibly some FR residues are substituted 
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by residues from analogous sites in rodent antibodies. Humanized antibodies generally contain at least a 

portion of an antibody constant region (Fc), typically that of a human antibody (Jones et ah, Nature, 

321:522-525 (1986), Reichmann et aL, Nature, 332:323-327 (1988), and Presta, Curr. Opin. Struct Biol, 

2:593-596 (1992)). 

5 276. Methods for humanizing non-human antibodies are well known in the art. For example, 

humanized antibodies can be generated according to the methods of Winter and co-workers (Jones et ah, 
Nature, 321:522-525 (1986), Riechmann et ah, Nature, 332:323-327 (1988), Verhoeyen et aL, Science, 
239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences 
of a human antibody. Methods that can be used to produce humanized antibodies are also described in 

10 U.S. Patent No. 4,8 16,567 (Cabilly et al.), U.S. Patent No. 5,565,332 (Hoogenboom et aL), U.S. Patent 
No. 5,721 ,367 (Kay et aL), U.S. Patent No. 5,837,243 (Deo et aL), U.S. Patent No. 5, 939,598 
(Kucherlapati et aL), U.S. Patent No. 6,130,364 (Jakobovits et aL), and U.S. Patent No. 6,180,377 
(Morgan et aL). 

(4) Administration of antibodies 

15 277. Administration of the antibodies can be done as disclosed herein, as the delivery of any non- 

nucleic acid composition or pharmaceutical. Nucleic acid approaches for antibody delivery also exist. 
The broadly neutralizing anti TSG101 or Vps4A or Vps4B or GAG antibodies and antibody fragments of 
the invention can also be administered to patients or subjects as a nucleic acid preparation (e.g., DNA or 
RNA) that encodes the antibody or antibody fragment, such that the patient's or subject's own cells take 

20 up the nucleic acid and produce and secrete the encoded antibody or antibody fragment. The delivery of 
the nucleic acid can be by any means, as disclosed herein, for example. 

c) Molecules that impair ATPase function of Vps4A and Vps4B 
278. Disclosed are variants of Vps4A and Vps4B that have impaired ATPase activity or which 
themselves act as negative regulators of ATPase activity of Vps4A and Vps4B. These molecules when 

25 administered to cells harboring a virus such as HIV or ebola can inhibit the budding of the virus. Thus, 
molecules or mutants that inhibit ATPase activity are preferred and disclosed. One example of such a 
mutant is the ATPase-defective GFP-Vps4 fusion proteins induces formation of enlarged endosomes 
(called "class E" compartments in yeast) that are defective in the sorting and recycling of endocytosed 
substrates (Bishop and Woodman, 2000) (which are herein incorporated by reference for material at least 

30 related to Vps4 and mutants thereof). Other mutants, for example, are two different dominant negative 
mutations: K173Q, which blocks ATP binding (Vps4 K i73a), and E228Q, which blocks ATP hydrolysis 
(Vps4 E 228Q) at the positions noted in Vps4. This shows that ATPase activity of Vps4 is required for 
retroviral, such as HIV budding, and that any mutation or molecule that inhibits Vps4 ATPase activity 
will function as and is disclosed herein. Mutants such as this exist for both Vps4A and Vps4B. 
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d) Compositions identified by screening with disclosed compositions / 
combinatorial chemistry or by using the structure information 
(1) Combinatorial chemistry 

279. The disclosed compositions can be used as targets for any combinatorial technique to 
5 identify molecules or macromolecular molecules that interact with the disclosed compositions in a 

desired way. The nucleic acids, peptides, and related molecules disclosed herein can be used as targets 
for the combinatorial approaches. Also disclosed are the compositions that are identified through 
combinatorial techniques or screening techniques, or through computer modeling techniques, in which 
the compositions herein disclosed and their structural information, or portions thereof, are used as the 
10 target in a combinatorial or screening protocol, or can direct combinatorial or screening protocols. 

280. It is understood that when using the disclosed compositions in combinatorial techniques or 
screening methods, molecules, such as macromolecular molecules, will be identified that have particular 
desired properties such as inhibition or stimulation or the target molecule's function. The molecules 
identified and isolated when using the disclosed compositions, such as, any of the proteins disclosed 

15 herein, such asTSGlOl or Vps4A or Vps4B or GAG, are also disclosed. Thus, the products produced 
using the combinatorial or screening approaches that involve the disclosed compositions, such as any of 
the proteins disclosed herein, such as TSG101 or Vps4A or Vps4B or GAG, are also considered herein 
disclosed. 

281 . Combinatorial chemistry includes but is not limited to all methods for isolating small 
20 molecules or macromolecules that are capable of binding either a small molecule or another 

macromolecule, typically in an iterative process. Proteins, oligonucleotides, and sugars are examples of 
macromolecules. For example, oligonucleotide molecules with a given function, catalytic or ligand- 
binding, can be isolated from a complex mixture of random oligonucleotides in what has been referred to 
as "in vitro genetics" (Szostak, TIBS 19:89, 1992). One synthesizes a large pool of molecules bearing 

25 random and defined sequences and subjects that complex mixture, for example, approximately 1 0 15 
individual sequences in 100 jag of a 100 nucleotide RNA, to some selection and enrichment process. 
Through repeated cycles of affinity chromatography and PCR amplification of the molecules bound to 
the Hgand on the column, Ellington and Szostak (1990) estimated that 1 in 10 10 RNA molecules folded in 
such a way as to bind a small molecule dyes. DNA molecules with such ligand-binding behavior have 

30 been isolated as well (Ellington and Szostak, 1992; Bock et al, 1992). Techniques aimed at similar goals 
exist for small organic molecules, proteins, antibodies and other macromolecules known to those of skill 
in the art. Screening sets of molecules for a desired activity whether based on small organic libraries, 
oligonucleotides, or antibodies is broadly referred to as combinatorial chemistry. Combinatorial 
techniques are particularly suited for defining binding interactions between molecules and for isolating 

35 molecules that have a specific binding activity, often called aptamers when the macromolecules are 
nucleic acids. 
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282. There are a number of methods for isolating proteins which either have de novo activity or a 
modified activity. For example, phage display libraries have been used to isolate numerous peptides that 
interact with a specific target. (See for example, United States Patent No. 6,03 1 ,07 1 ; 5,824,520; 
5,596,079; and 5,565,332 which are herein incorporated by reference at least for their material related to 
phage display and methods relate to combinatorial chemistry) 

283. A preferred method for isolating proteins that have a given function is described by Roberts 
and Szostak (Roberts R.W. and Szostak J.W. Proc. Natl. Acad. Sci. USA, 94(23)12997-302 (1997). This 
combinatorial chemistry method couples the functional power of proteins and the genetic power of 
nucleic acids. An RNA molecule is generated in which a puromycin molecule is covalently attached to 
the 3'-end of the RNA molecule. An in vitro translation of this modified RNA molecule causes the 
correct protein, encoded by the RNA to be translated. In addition, because of the attachment of the 
puromycin, a peptidyl acceptor which cannot be extended, the growing peptide chain is attached to the 
puromycin which is attached to the RNA. Thus, the protein molecule is attached to the genetic material 
that encodes it. Normal in vitro selection procedures can now be done to isolate functional peptides. 
Once the selection procedure for peptide function is complete traditional nucleic acid manipulation 
procedures are performed to amplify the nucleic acid that codes for the selected functional peptides. 
After amplification of the genetic material, new RNA is transcribed with puromycin at the 3'-end, new 
peptide is translated and another functional round of selection is performed. Thus, protein selection can 
be performed in an iterative manner just like nucleic acid selection techniques. The peptide which is 
translated is controlled by the sequence of the RNA attached to the puromycin. This sequence can be 
anything from a random sequence engineered for optimum translation (i.e. no stop codons etc.) or it can 
be a degenerate sequence of a known RNA molecule to look for improved or altered function of a known 
peptide. The conditions for nucleic acid amplification and in vitro translation are well known to those of 
ordinary skill in the art and are preferably performed as in Roberts and Szostak (Roberts R.W. and 
Szostak J.W. Proc. Natl. Acad. Sci. USA, 94(23)12997-302 (1997)). 

284. Another preferred method for combinatorial methods designed to isolate peptides is 
described in Cohen et al. (Cohen B.A.,et al., Proc. Natl. Acad. Sci. USA 95(24): 14272-7 (1998)). This 
method utilizes and modifies two-hybrid technology. Yeast two-hybrid systems are useful for the 
detection and analysis of proteimprotein interactions. The two-hybrid system, initially described in the 
yeast Saccharomyces cerevisiae, is a powerful molecular genetic technique for identifying new 
regulatory molecules, specific to the protein of interest (Fields and Song, Nature 340:245-6 (1989)). 
Cohen et al., modified this technology so that novel interactions between synthetic or engineered peptide 
sequences could be identified which bind a molecule of choice. The benefit of this type of technology is 
that the selection is done in an intracellular environment. The method utilizes a library of peptide 
molecules that attached to an acidic activation domain. A peptide of choice, for example, a portion of 
TSG101 or Vps4A or Vps4B or GAG, such as the TSG101 PTAP binding domain, is attached to a DNA 
binding domain of a transcriptional activation protein, such as Gal 4. By performing the two-hybrid 
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technique on this type of system, molecules that bind the TSG101 or Vps4A or Vps4B or GAG can be 

identified. 

285. Using methodology well known to those of skill in the art, in combination with various 
combinatorial libraries, one can isolate and characterize those small molecules or macromolecules, which 

5 bind to or interact with the desired target. The relative binding affinity of these compounds can be 

compared and optimum compounds identified using competitive binding studies, which are well known 

to those of skill in the art. 

286. Techniques for making combinatorial libraries and screening combinatorial libraries to 
isolate molecules which bind a desired target are well known to those of skill in the art. Representative 

10 techniques and methods can be found in but are not limited to United States patents 5,084,824, 

5,288,514, 5,449,754, 5,506,337, 5,539,083, 5,545,568, 5,556,762, 5,565,324, 5,565,332, 5,573,905, 
5,618,825, 5,619,680, 5,627,210, 5,646,285, 5,663,046, 5,670,326, 5,677,195, 5,683,899, 5,688,696, 
5,688,997, 5,698,685, 5,712,146, 5,721,099, 5,723,598, 5,741,713, 5,792,431, 5,807,683, 5,807,754, 
5,821,130, 5,831,014, 5,834,195, 5,834,318, 5,834,588, 5,840,500, 5,847,150, 5,856,107, 5,856,496, 

15 5,859,190, 5,864,010, 5,874,443, 5,877,214, 5,880,972, 5,886,126, 5,886,127, 5,891,737, 5,916,899, 
5,919,955, 5,925,527, 5,939,268, 5,942,387, 5,945,070, 5,948,696, 5,958,702, 5,958,792, 5,962,337, 
5,965,719, 5,972,719, 5,976,894, 5,980,704, 5,985,356, 5,999,086, 6,001,579, 6,004,617, 6,008,321, 
6,017,768, 6,025,371, 6,030,917, 6,040,193, 6,045,671, 6,045,755, 6,060,596, and 6,061,636. 

287. Combinatorial libraries can be made from a wide array of molecules using a number of 
20 different synthetic techniques. For example, libraries containing fused 2,4-pyrimidinediones (United 

States patent 6,025,371) dihydrobenzopyrans (United States Patent 6,017,768and 5,821,130), amide 
alcohols (United States Patent 5,976,894), hydroxy-amino acid amides (United States Patent 5,972,719) 
carbohydrates (United States patent 5,965,719), l,4-benzodiazepin-2,5-diones (United States patent 
5,962,337), cyclics (United States patent 5,958,792), biaryl amino acid amides (United States patent 

25 5,948,696), thiophenes (United States patent 5,942,387), tricyclic Tetrahydroquinolines (United States 
patent 5,925,527), benzofurans (United States patent 5,919,955), isoquinoiines (United States patent 
5,916,899), hydantoin and thiohydantoin (United States patent 5,859,190), indoles (United States patent 
5,856,496), imidazol-pyrido-indole and imidazol-pyrido-benzothiophenes (United States patent 
5,856,107) substituted 2-methylene-2, 3-dihydrothiazoles (United States patent 5,847,150), quinolines 

30 (United States patent 5 ,840,500), PNA (United States patent 5,83 1 ,0 14), containing tags (United States 
patent 5,721,099), polyketides (United States patent 5,712,146), morpholino-subunits (United States 
patent 5,698,685 and 5,506,337), sulfamides (United States patent 5,618,825), and benzodiazepines 
(United States patent 5,288,514). 

288. As used herein combinatorial methods and libraries included traditional screening methods 

35 and libraries as well as methods and libraries used in iterative processes. 
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5. Computer readable mediums 

289. It is understood that the disclosed nucleic acids and proteins can be represented as a 
sequence consisting of the nucleotides of amino acids. There are a variety of ways to display these 
sequences, for example the nucleotide guanosine can be represented by G or g. Likewise the amino acid 

5 valine can be represented by Val or V. Those of skill in the art understand how to display and express 
any nucleic acid or protein sequence in any of the variety of ways that exist, each of which is considered 
herein disclosed. Specifically contemplated herein is the display of these sequences on computer 
readable mediums, such as, commercially available floppy disks, tapes, chips, hard drives, compact disks, 
and video disks, or other computer readable mediums. Also disclosed are the binary code representations 

10 of the disclosed sequences. Those of skill in the art understand what computer readable mediums. Thus, 
computer readable mediums on which the nucleic acids or protein sequences are recorded, stored, or 
saved. 

6. General composition information 

a) Sequence similarities 

15 290. It is understood that as discussed herein the use of the terms homology and identity mean 

the same thing as similarity. Thus, for example, if the use of the word homology is used between two 
non-natural sequences it is understood that this is not necessarily indicating an evolutionary relationship 
between these two sequences, but rather is looking at the similarity or relatedness between their nucleic 
acid sequences. Many of the methods for determining homology between two evolutionarily related 

20 molecules are routinely applied to any two or more nucleic acids or proteins for the purpose of measuring 
sequence similarity regardless of whether they are evolutionarily related or not. 

291 . In general, it is understood that one way to define any known variants and derivatives or 
those that might arise, of the disclosed genes and proteins herein, is through defining the variants and 
derivatives in terms of homology to specific known sequences. This identity of particular sequences 

25 disclosed herein is also discussed elsewhere herein. In general, variants of genes and proteins herein 
disclosed typically have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 
87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the stated sequence or the native 
sequence. Those of skill in the art readily understand how to determine the homology of two proteins or 
nucleic acids, such as genes. For example, the homology can be calculated after aligning the two 

30 sequences so that the homology is at its highest level. 

292. Another way of calculating homology can be performed by published algorithms. Optimal 
alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and 
Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and 
Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. 

35 Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
Group, 575 Science Dr., Madison, WI), or by inspection. 



RNRnnnin- <wn 



- WO 2004/039311 PCT/US2003/0 15722 

293. The same types of homology can be obtained for nucleic acids by for example the 

algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl Acad. Set USA 
86:7706-7710, 1989, Jaeger et al. Methods Enzymol 183:281-306, 1989 which are herein incorporated by 
reference for at least material related to nucleic acid alignment. It is understood that any of the methods 
5 typically can be used and that in certain instances the results of these various methods may differ, but the 
skilled artisan understands if identity is found with at least one of these methods, the sequences would be 
said to have the stated identity, and be disclosed herein. 

294. For example, as used herein, a sequence recited as having a particular percent homology to 
another sequence refers to sequences that have the recited homology as calculated by any one or more of 

10 the calculation methods described above. For example, a first sequence has 80 percent homology, as 
defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to 
the second sequence using the Zuker calculation method even if the first sequence does not have 80 
percent homology to the second sequence as calculated by any of the other calculation methods. As 
another example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the 

15 first sequence is calculated to have 80 percent homology to the second sequence using both the Zuker 
calculation method and the Pearson and Lipman calculation method even if the first sequence does not 
have 80 percent homology to the second sequence as calculated by the Smith and Waterman calculation 
method, the Needleman and Wunsch calculation method, the Jaeger calculation methods, or any of the 
other calculation methods. As yet another example, a first sequence has 80 percent homology, as defined 

20 herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second 
sequence using each of calculation methods (although, in practice, the different calculation methods will 
often^result in different calculated homology percentages). 

b) Hybridization 

295. The term hybridization typically means a sequence driven interaction between at least two 
25 nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven interaction means an 

interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives in a 
nucleotide specific manner. For example, G interacting with C or A interacting with T are sequence 
driven interactions. Typically sequence driven interactions occur on the Watson-Crick face or Hoogsteen 
face of the nucleotide. The hybridization of two nucleic acids is affected by a number of conditions and 
30 parameters known to those of skill in the art. For example, the salt concentrations, pH, and temperature 
of the reaction all affect whether two nucleic acid molecules will hybridize. 

296. Parameters for selective hybridization between two nucleic acid molecules are well known 
to those of skill in the art. For example, in some embodiments selective hybridization conditions can be 
defined as stringent hybridization conditions. For example, stringency of hybridization is controlled by 

35 both temperature and salt concentration of either or both of the hybridization and washing steps. For 

example, the conditions of hybridization to achieve selective hybridization may involve hybridization in 
high ionic strength solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm 
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(the melting temperature at which half of the molecules dissociate from their hybridization partners) 
followed by washing at a combination of temperature and salt concentration chosen so that the washing 
temperature is about 5°C to 20°C below the Tm. The temperature and salt conditions are readily 
determined empirically in preliminary experiments in which samples of reference DNA immobilized on 
5 filters are hybridized to a labeled nucleic acid of interest and then washed under conditions of different 
stringencies. Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA 
hybridizations. The conditions can be used as described above to achieve stringency, or as is known in 
the art. (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987 
1 0 which is herein incorporated by reference for material at least related to hybridization of nucleic acids). 
A preferable stringent hybridization condition for a DNA:DNA hybridization can be at about 68°C (in 
aqueous solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization and 
washing, if desired, can be reduced accordingly as the degree of complementarity desired is decreased, 
and further, depending upon the G-C or A-T richness of any area wherein variability is searched for. 
Likewise, stringency of hybridization and washing, if desired, can be increased accordingly as homology 
desired is increased, and further, depending upon the G-C or A-T richness of any area wherein high 
homology is desired, all as known in the art. 

297. Another way to define selective hybridization is by looking at the amount (percentage) of 
one of the nucleic acids bound to the other nucleic acid. For example, in some embodiments selective 

20 hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78 79 80 
81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the limiting nucleic 
acid is bound to the non-limiting nucleic acid. Typically, the non-limiting primer is in for example, 10 or 
100 or 1000 fold excess. This type of assay can be performed at under conditions where both the limiting 
and non-limiting primer are for example, 10 fold or 100 fold or 1000 fold below their k„, or where only 

25 one' of the nucleic acid molecules is 1 0 fold or 1 00 fold or 1 000 fold or where one or both nucleic acid 
molecules are above their k<j. 

298. Another way to define selective hybridization is by looking at the percentage of primer that 
gets enzymatically manipulated under conditions where hybridization is required to promote the desired 
enzymatic manipulation. For example, in some embodiments selective hybridization conditions would 

30 be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 
90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the primer is enzymatically manipulated under 
conditions which promote the enzymatic manipulation, for example if the enzymatic manipulation is 
DNA extension, then selective hybridization conditions would be when at least about 60, 65, 70, 71, 72, 
73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 

35 percent of the primer molecules are extended. Preferred conditions also include those suggested by the 
manufacturer or indicated in the art as being appropriate for the enzyme performing the manipulation. 
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299. Just as with homology, it is understood that there are a variety of methods herein disclosed 
for determining the level of hybridization between two nucleic acid molecules. It is understood that these 
methods and conditions may provide different percentages of hybridization between two nucleic acid 
molecules, but unless otherwise indicated meeting the parameters of any of the methods would be 

5 sufficient. For example if 80% hybridization was required and as long as hybridization occurs within the 
required parameters in any one of these methods it is considered disclosed herein. 

300. It is understood that those of skill in the art understand that if a composition or method 
meets any one of these criteria for determining hybridization either collectively or singly it is a 
composition or method that is disclosed herein. 

10 c) Nucleic acids 

301. There are a variety of molecules disclosed herein that are nucleic acid based, including for 
example die nucleic acids that encode, for example TSG101, Vps4A, or Vps4B, Bc2, Vps28, or Vps37, 
as well as various functional nucleic acids. The disclosed nucleic acids are made up of, for example, 
nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and other 

15 molecules are discussed herein. It is understood that for example, when a vector is expressed in a cell, 
that the expressed mRNA will typically be made up of A, C, G, and U. Likewise, it is understood that if, 
for example, an antisense molecule is introduced into a cell or cell environment through for example 
exogenous delivery, it is advantagous that the antisense molecule be made up of nucleotide analogs that 
reduce the degradation of the antisense molecule in the cellular environment. 

20 (1) Nucleotides and related molecules 

302. A nucleotide is a molecule that contains a base moiety, a sugar moiety and a phosphate 
moiety. Nucleotides can be linked together through their phosphate moieties and sugar moieties creating 
an internucleoside linkage. The base moiety of a nucleotide can be adenin-9-yl (A), cytosin-l-yl (C), 
guanin-9-yl (G), uracil-l-yl (U), and thymin-l-yl (T). The sugar moiety of a nucleotide is a ribose or a 

25 deoxyribose. The phosphate moiety of a nucleotide is pentavalent phosphate. An non-limiting example 
of a nucleotide would be 3-AMP (3-adenosine monophosphate) or 5-GMP (5'-guanosine 
monophosphate). 

303. A nucleotide analog is a nucleotide which contains some type of modification to either the 
base, sugar, or phosphate moieties. Modifications to nucleotides are well known in the art and would 

30 include for example, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, and 
2-aminoadenine as well as modifications at the sugar or phosphate moieties. 

304. Nucleotide substitutes are molecules having similar functional properties to nucleotides, but 
which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). Nucleotide substitutes are 
molecules that will recognize nucleic acids in a Watson-Crick or Hoogsteen manner, but which are 

35 linked together through a moiety other than a phosphate moiety. Nucleotide substitutes are able to 
conform to a double helix type structure when interacting with the appropriate target nucleic acid. 
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305. It is also possible to link other types of molecules (conjugates) to nucleotides or nucleotide 
analogs to enhance for example, cellular uptake. Conjugates can be chemically linked to the nucleotide 
or nucleotide analogs. Such conjugates include but are not limited to lipid moieties such as a cholesterol 
moiety. (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989,86, 6553-6556), 

306. A Watson-Crick interaction is at least one interaction with the Watson-Crick face of a 
nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick face of a nucleotide, 
nucleotide analog, or nucleotide substitute includes the C2, Nl, and C6 positions of a purine based 
nucleotide, nucleotide analog, or nucleotide substitute and the C2, N3 5 C4 positions of a pyrimidine based 
nucleotide, nucleotide analog, or nucleotide substitute. 

307. A Hoogsteen interaction is the interaction that takes place on the Hoogsteen face of a 
nucleotide or nucleotide analog, which is exposed in the major groove of duplex DNA. The Hoogsteen 
face includes the N7 position and reactive groups (NH2 or O) at the C6 position of purine nucleotides. 

(2) Sequences 

308. There are a variety of sequences related to the TSG101 (See for example, Q99816, Q61 187 
NP_006283, NP_057292, NP_068684), Vps4A, or Vps4B, Bc2, Vps37, Vps28, and GAG, these 
sequences and others are herein incorporated by reference in their entireties as well as for individual 
subsequences contained therein. 

309. It is understood that the description related to the disclosed sequences is applicable to any 
sequence related to any of the proteins and genes disclosed herein, such as TSG101, Vps4A, or Vps4B or 
retroviral GAG, for example, unless specifically indicated otherwise. Those of skill in the art understand 
how to resolve sequence discrepancies and differences and to adjust the compositions and methods 
relating to a particular sequence to other related sequences (i.e. sequences of TSG101, Vps4A, or 
Vps4B). Primers and/or probes can be designed for any of the disclosed molecules, such as TSG101, 
Vps4A, or Vps4B sequence given the information disclosed herein and known in the art. 

(3) Primers and probes 

310. Disclosed are compositions including primers and probes, which are capable of interacting 
with any of the disclosed nucleic acids, such as the TSG101, Vps4A, or Vps4B genes or GAG as 
disclosed herein. In certain embodiments the primers are used to support DNA amplification reactions. 
Typically the primers will be capable of being extended in a sequence specific manner. Extension of a 
primer in a sequence specific manner includes any methods wherein the sequence and/or composition of 
the nucleic acid molecule to which the primer is hybridized or otherwise associated directs or influences 
the composition or sequence of the product produced by the extension of the primer. Extension of the 
primer in a sequence specific manner therefore includes, but is not limited to, PCR, DNA sequencing, 
DNA extension, DNA polymerization, RNA transcription, or reverse transcription. Techniques and 
conditions that amplify the primer in a sequence specific manner are preferred. In certain embodiments 
the primers are used for the DNA amplification reactions, such as PCR or direct sequencing. It is 
understood that in certain embodiments the primers can also be extended using non-enzymatic 
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techniques, where for example, the nucleotides or oligonucleotides used to extend the primer are 
modified such that they will chemically react to extend the primer in a sequence specific manner. 
Typically the disclosed primers hybridize with any of the disclosed nucleic acids, such as TSG101, 
Vps4A, or Vps4B nucleic acid or region of the TSG101, Vps4A, or Vps4B nucleic acid or they hybridize 
5 with the complement of the TSG101, Vps4A, or Vps4B nucleic acid or complement of a region of the 
TSG101, Vps4A, or Vps4B nucleic acid. 

d) Delivery of the compositions to cells 
(1) Nucleic Acid Delivery 

311. There are a number of compositions and methods which can be used to deliver nucleic acids 
10 to cells, either in vitro or in vivo. These methods and compositions can largely be broken down into two 

classes: viral based delivery systems and non-viral based delivery systems. For example, the nucleic 
acids can be delivered through a number of direct delivery systems such as, electroporation, lipofection, 
calcium phosphate precipitation, plasmids, viral vectors, viral nucleic acids, phage nucleic acids, phages, 
cosmids, or via transfer of genetic material in cells or carriers such as cationic liposomes. Appropriate 

15 means for transfection, including viral vectors, chemical transfectants, or physico-mechanical methods 
such as electroporation and direct diffusion of DNA, are described by, for example, Wolff, J. A., et al., 
Science, 247, 1465-1468, (1990); and Wolff, J. A. Nature, 352, 815-818, (1991) Such methods are well 
known in the art and readily adaptable for use with the compositions and methods described herein. In 
certain cases, the methods will be modified to specifically function with large DNA molecules. Further, 

20 these methods can be used to target certain diseases and cell populations by using the targeting 
characteristics of the carrier. 

312. In the methods described herein, which include the administration and uptake of exogenous 
DNA into the cells of a subject (i.e., gene transduction or transfection), the nucleic acids of the present 
invention can be in the form of naked DNA or RNA, or the nucleic acids can be in a vector for delivering 

25 the nucleic acids to the cells, whereby the encoding DNA or DNA or fragment is under the 

transcriptional regulation of a promoter, as would be well understood by one of ordinary skill in the art as 
well as enhancers. The vector can be a commercially available preparation, such as an adenovirus vector 
(Quantum Biotechnologies, Inc. (Laval, Quebec, Canada). 

313. As one example, vector delivery can be via a viral system, such as a retroviral vector system 
30 which can package a recombinant retroviral genome (see e.g., Pastan et al., Proc. Natl. Acad. Sci. U.S.A. 

85:4486, 1988; Miller et al., Mol Cell. BioL 6:2895, 1986). The recombinant retrovirus can then be used 
to infect and thereby deliver to the infected cells nucleic acid encoding a broadly neutralizing antibody 
(or active fragment thereof) of the invention. The exact method of introducing the altered nucleic acid 
into mammalian cells is, of course, not limited to the use of retroviral vectors. Other techniques are 
35 widely available for this procedure including the use of adenoviral vectors (Mitani et al., Ham. Gene 
Titer. 5:941-948, 1994), adeno-associated viral (AAV) vectors (Goodman et al., Blood 84:1492-1500, 
1994), lentiviral vectors (Naidini et al., Science 272:263-267, 1996), pseudotyped retroviral vectors 
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(Agrawal et al., Exper. HematoL 24:738-747, 1996). Physical transduction techniques can also be used, 

such as liposome delivery and receptor-mediated and other endocytosis mechanisms (see, for example, 

Schwartzenberger et al., Blood 87:472-478, 1996). This invention can be used in conjunction with any of 

these or other commonly used gene transfer methods. 

5 3 14. As one example, if the antibody-encoding nucleic acid or some other nucleic acid encoding 

an inhibitor of the TSG101, Vps4A, or Vps4B-GAG interactions of the invention is delivered to the cells 

of a subject in an adenovirus vector, the dosage for administration of adenovirus to humans can range 

from about 10 7 to 10 9 plaque forming units (pfu) per injection but can be as high as 10 ,z pfu per injection 

(Crystal, Hum. Gene Then 8:985-1001, 1997; Alvarez and Curiel, /ftuit. Gene Ther. 8:597-613, 1997). A 

10 subject can receive a single injection, or, if additional injections are necessary, they can be repeated at six 

month intervals (or other appropriate time intervals, as determined by the skilled practitioner) for an 

indefinite period and/or until the efficacy of the treatment has been established. 

315. Parenteral administration of the nucleic acid or vector of the present invention, if used, is 
generally characterized by injection. Injectables can be prepared in conventional forms, either as liquid 

15 solutions or suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as 
emulsions. A more recently revised approach for parenteral administration involves use of a slow release 
or sustained release system such that a constant dosage is maintained. See, e.g., U.S. Patent No. 
3,610,795, which is incorporated by reference herein. For additional discussion of suitable formulations 
and various routes of administration of therapeutic compounds, see, e.g., Remington: The Science and 

20 Practice of Pharmacy (19th ed.) ed. A.R. Gennaro, Mack Publishing Company, Easton, PA 1995. 

316. Nucleic acids that are delivered to cells which are to be integrated into the host cell genome, 
typically contain integration sequences. These sequences are often viral related sequences, particularly 
when viral based systems are used. These viral integration systems can also be incorporated into nucleic 
acids which are to be delivered using a non-nucleic acid based system of deliver, such as a liposome, so 

25 that the nucleic acid contained in the delivery system can be come integrated into the host genome. 

317. Other general techniques for integration into the host genome include, for example, systems 
designed to promote homologous recombination with the host genome. These systems typically rely on 
sequence flanking the nucleic acid to be expressed that has enough homology with a target sequence 
within the host cell genome that recombination between the vector nucleic acid and the target nucleic 

30 acid takes place, causing the delivered nucleic acid to be integrated into the host genome. These systems 
and the methods necessary to promote homologous recombination are known to those of skill in the art. 

(2) Non-nucleic acid based systems 

318. The disclosed compositions can be delivered to the target cells in a variety of ways. For 
example, the compositions can be delivered through electroporation, or through lipofection, or through 

35 calcium phosphate precipitation. The delivery mechanism chosen will depend in part on the type of cell 
targeted and whether the delivery is occurring for example in vivo or in vitro. 
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319. Thus, the compositions can comprise, in addition to the disclosed compositions or vectors 
for example, lipids such as liposomes, such as cationic liposomes (e.g., DOTMA, DOPE, 
DC-cholesterol) or anionic liposomes. Liposomes can further comprise proteins to facilitate targeting a 
particular cell, if desired. Administration of a composition comprising a compound and a cationic 

5 liposome can be administered to the blood afferent to a target organ or inhaled into the respiratory tract to 
target cells of the respiratory tract. Regarding liposomes, see, e.g., Brigham et al. Am. J. Jtesp. Cell. Mol. 
Biol. 1:95-100 (1989); Feigner et al. Proc. Natl. Acad. Sci USA 84:7413-7417 (1987); U.S. Pat. 
No.4,897,355. Furthermore, the compound can be administered as a component of a microcapsule that 
can be targeted to specific cell types, such as macrophages, or where the diffusion of the compound or 
10 delivery of the compound from the microcapsule is designed for a specific rate or dosage. 

320. In the methods described above which include the administration and uptake of exogenous 
DNA into the cells of a subject (i.e., gene transduction or transfection), delivery of the compositions to 
cells can be via a variety of mechanisms. As one example, delivery can be via a liposome, using 
commercially available liposome preparations such as LIPOFECTIN, LIPOFECT AMINE (GIBCO-BRL, 

15 Inc., Gaithersburg, MD), SUPERFECT (Qiagen, Inc. Hilden, Germany) and TRANSFECTAM (Promega 
Biotec, Inc., Madison, WI), as well as other liposomes developed according to procedures standard in the 
art. In addition, the nucleic acid or vector of this invention can be delivered in vivo by electroporatipn, 
the technology for which is available from Genetronics, Inc. (San Diego, CA) as well as by means of a 
SONOPORATION machine (ImaRx Pharmaceutical Corp., Tucson, AZ). 

20 321. The materials may be in solution, suspension (for example, incorporated into microparticles, 

liposomes, or cells). These may be targeted to a particular cell type via antibodies, receptors, or receptor 
ligands. The following references are examples of the. use of this technology to target specific proteins to 
tumor tissue (Senter, et al., Bioconiugate Chem. , 2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer. 
60:275-281, (1989); Bagshawe, et al., Br. J. Cancer . 58:700-703, (1988); Senter, et al., Bioconiugate 

25 Cheiru 4:3-9, (1993); Battelli, et al., Cancer Immunol. Immunother. . 35:421-425, (1992); Pietersz and 
McKenzie, Immunolog. Reviews , 129:57-80, (1992); and Roffler, et al, Biochem. Pharmacol . 
42:2062-2065, (1991)). These techniques can be used for a variety of other specific cell types. Vehicles 
such as "stealth" and other antibody conjugated liposomes (including lipid mediated drug targeting to 
colonic carcinoma), receptor mediated targeting of DNA through cell specific ligands, lymphocyte 

30 directed tumor targeting, and highly specific therapeutic retroviral targeting of murine glioma cells in 
vivo. The following references are examples of the use of this technology to target specific proteins to 
tumor tissue (Hughes et al., Cancer Research . 49:6214-6220, (1989); and Litzinger and Huang, 
Biochiniic a et Biop hvsica Acta. 1 104: 179-187, (1992)). In general, receptors are involved in pathways 
of endocytosis, either constitutive or ligand induced. These receptors cluster in clathrin-coated pits, enter 

35 the cell via clathrin-coated vesicles, pass through an acidified endosome in which the receptors are 
sorted, and then either recycle to the cell surface, become stored intracellular!^ or are degraded in 
lysosomes. The internalization pathways serve a variety of functions, such as nutrient uptake, removal of 
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activated proteins, clearance of rnacromolecules, opportunistic entry of viruses and toxins, dissociation 

and degradation of ligand, and receptor-level regulation, Many receptors follow more than one 

intracellular pathway, depending on the cell type, receptor concentration, type of ligand, ligand valency, 

and ligand concentration. Molecular and cellular mechanisms of receptor-mediated endocytosis has been 

5 reviewed (Brown and Greene, DNA and Cell Biology 10:6, 399-409 (1991)). 

(3) In vivo/ex vivo 

322. As described above, the compositions can be administered in a pharmaceutically acceptable 
carrier and can be delivered to the subjects cells in vivo and/or ex vivo by a variety of mechanisms well 
known in the art (e.g., uptake of naked DNA, liposome fusion, intramuscular injection of DNA via a gene 

10 gun, endocytosis and the like). 

323. If ex vivo methods are employed, cells or tissues can be removed and maintained outside the 
body according to standard protocols well known in the art. The compositions can be introduced into the 
cells via any gene transfer mechanism, such as, for example, calcium phosphate mediated gene delivery, 
electroporation, microinjection or proteoliposomes. The transduced cells can then be infused (e.g., in a 

15 pharmaceutically acceptable carrier) or homotopically transplanted back into the subject per standard 
methods for the cell or tissue type. Standard methods are known for transplantation or infusion of 
various cells into a subject. 

e) Expression systems 

324. The nucleic acids that are delivered to cells typically contain expression controlling systems. 
20 For example, the inserted genes in viral and retroviral systems usually contain promoters, and/or 

enhancers to help control the expression of the desired gene product. A promoter is generally a sequence 
or sequences of DNA that function when in a relatively fixed location in regard to the transcription start 
site. A promoter contains core elements required for basic interaction of RNA polymerase and 
transcription factors, and may contain upstream elements and response elements. 
25 (1) Viral Promoters and Enhancers 

325. Preferred promoters controlling transcription from vectors in mammalian host cells may be 
obtained from various sources, for example, the genomes of viruses such as: polyoma, Simian Virus 40 
(SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably cytomegalovirus, or from 
heterologous mammalian promoters, e.g. beta actin promoter. The early and late promoters of the SV40 

30 virus are conveniently obtained as an SV40 restriction fragment which also contains the SV40 viral 

origin of replication (Fiers et al., Nature . 273: 1 13 (1978)). The immediate early promoter of the human 
cytomegalovirus is conveniently obtained as a Hin dlll E restriction fragment (Greenway, PJ. et al., 
Gene 18: 355-360 (1982)). Of course, promoters from the host cell or related species also are useful 
herein. 

35 326. Enhancer generally refers to a sequence of DNA that functions at no fixed distance from the 

transcription start site and can be either 5' (Laimins, L. et al., Proc. Natl Acad. Sci. 78: 993 (1981)) or 3' 

(Lusky, M.L., et al., Mol. Cell Bio. 3: 1 108 (1983)) to the transcription unit. Furthermore, enhancers can 

be within an intron (Banerji, J.L. et al., Cell 33: 729 (1983)) as well as within the coding sequence itself 
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(Osborne, T.F., et al. s Mol. Cell Bio. 4: 1293 (1984)). They are usually between 10 and 300 bp in length, 

and they function in cis. Enhancers f unction to increase transcription from nearby promoters. 

Enhancers also often contain response elements that mediate the regulation of transcription. Promoters 

can also contain response elements that mediate the regulation of transcription. Enhancers often 

5 determine the regulation of expression of a gene. While many enhancer sequences are now known from 

mammalian genes (globin, elastase, albumin, -fetoprotein and insulin), typically one will use an 

enhancer from a eukaryotic cell virus for general expression. Examples are the S V40 enhancer on the 

late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the 

polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. 

10 327. The promoter and/or enhancer may be specifically activated either by light or specific 

chemical events which trigger their function. Systems can be regulated by reagents such as tetracycline 
and dexamethasone. There are also ways to enhance viral vector gene expression by exposure to 
irradiation, such as gamma irradiation, or alkylating chemotherapy drugs. 

328. In certain embodiments the promoter and/or enhancer region can act as a constitutive 

15 promoter and/or enhancer to maximize expression of the region of the transcription unit to be transcribed. 
In certain constructs the promoter and/or enhancer region be active in all eukaryotic cell types, even if it 
is only expressed in a particular type of cell at a particular time. A preferred promoter of this type is the 
CMV promoter (650 bases). Other preferred promoters are SV40 promoters, cytomegalovirus (full 
length promoter), and retroviral vector LTF. 

20 329. It has been shown that all specific regulatory elements can be cloned and used to construct 

expression vectors that are selectively expressed in specific cell types such as melanoma cells. The glial 
fibrillary acetic protein (GFAP) promoter has been used to selectively express genes in cells of glial 
origin. 

330. Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human 
25 or nucleated cells) may also contain sequences necessary for the termination of transcription which may 
affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated 
portion of the mRNA encoding tissue factor protein. The 3' untranslated regions also include 
transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation 
region. One benefit of this region is that it increases the likelihood that the transcribed unit will be 
30 processed and transported like mRNA. The identification and use of polyadenylation signals in 

expression constructs is well established. It is preferred that homologous polyadenylation signals be used 
in the transgene constructs. In certain transcription units, the polyadenylation region is derived from the 
SV40 early polyadenylation signal and consists of about 400 bases. It is also preferred that the 
transcribed units contain other standard sequences alone or in combination with the above sequences 
35 improve expression from, or stability of, the construct. 
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(2) Markers 

33 1 . The vectors can include nucleic acid sequence encoding a marker product. This marker 
product is used to determine if the gene has been delivered to the cell and once delivered is being 
expressed. Preferred marker genes are the E. Coli lacZ gene, which encodes B-galactosidase, and green 

5 fluorescent protein. 

332. In some embodiments the marker may be a selectable marker. Examples of suitable 
selectable markers for mammalian cells are dihydrofolate reductase (DHFR), thymidine kinase, 
neomycin, neomycin analog G418, hydromycin, and puromycin. When such selectable markers are 
successfully transferred into a mammalian host cell, the transformed mammalian host cell can survive if 

10 placed under selective pressure. There are two widely used distinct categories of selective regimes. The 
first category is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to 
grow independent of a supplemented media. Two examples are: CHO DHFR- cells and mouse LTK- 
cells. These cells lack the ability to grow without the addition of such nutrients as thymidine or 
hypoxanthine. Because these cells lack certain genes necessary for a complete nucleotide synthesis 

15 pathway, they cannot survive unless the missing nucleotides are provided in a supplemented media. An 
alternative to supplementing the media is to introduce an intact DHFR or TK gene into cells lacking the 
respective genes, thus altering their growth requirements. Individual cells which were not transformed 
with the DHFR or TK gene will not be capable of survival in non-supplemented media. 

333. The second category is dominant selection which refers to a selection scheme used in any 
20 cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest 

growth of a host cell. Those cells which have a novel gene would express a protein conveying drug 
resistance and would survive the selection. Examples of such dominant selection use the drugs 
neomycin, (Southern P. and Berg, P., J. Molec. Appl. Genet. 1: 327 (1982)), mycophenolic acid, 
(Mulligan, R.C. and Berg, P. Science 209: 1422 (1980)) or hygromycin, (Sugden, B. et al., Mol. Cell. 
25 Biol. 5: 410-413 (1985)). The three examples employ bacterial genes under eukaryotic control to 

convey resistance to the appropriate drug G418 or neomycin (geneticin), xgpt (mycophenolic acid) or 
hygromycin, respectively. Others include the neomycin analog G418 and puramycin. 

£) Peptides 

(1) Protein variants 

30 334. As discussed herein there are numerous variants of the TSG101, Vps4A, or Vps4B proteins 

and GAG protein that are known and herein contemplated. In addition, to the known functional TSG101, 
Vps4A, or Vps4B strains and GAG strain there are derivatives and variants of the TSG101, Vps4A, or 
Vps4B and GAG proteins which also function in the disclosed methods and compositions. Protein 
variants and derivatives are well understood to those of skill in the art and in can involve amino acid 

35 sequence modifications. For example, amino acid sequence modifications typically fall into one or more 

of three classes: substitutional, insertional or deletional variants. Insertions include amino and/or 

carboxyl terminal fusions as well as intrasequence insertions of single or multiple amino acid residues. 

Insertions ordinarily will be smaller insertions than those of amino or carboxyl terminal fusions, for 
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example, on the order of one to four residues. Immunogenic fusion protein derivatives, such as those 
described in the examples, are made by fusing a polypeptide sufficiently large to confer immunogenicity 
to the target sequence by cross-linking in vitro or by recombinant cell culture transformed with DNA 
encoding the fusion. Deletions are characterized by the removal of one or more amino acid residues from 
5 the protein sequence. Typically, no more than about from 2 to 6 residues are deleted at any one site 
within the protein molecule. These variants ordinarily are prepared by site specific mutagenesis of 
nucleotides in the DNA encoding the protein, thereby producing DNA encoding the variant, and 
thereafter expressing the DNA in recombinant cell culture. Techniques for making substitution 
mutations at predetermined sites in DNA having a known sequence are well known, for example M13 
10 primer mutagenesis and PCR mutagenesis. Amino acid substitutions are typically of single residues, but 
can occur at a number of different locations at once; insertions usually will be on the order of about from 
1 to 10 amino acid residues; and deletions will range about from 1 to 30 residues. Deletions or insertions 
preferably are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. 
Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a final 
1 5 construct. The mutations must not place the sequence out of reading frame and preferably will not create 
complementary regions that could produce secondary mRNA structure. Substitutional variants are those 
in which at least one residue has been removed and a different residue inserted in its place. Such 
substitutions generally are made in accordance with the following Tables 1 and 2 and are referred to as 
conservative substitutions. 
20 335. TABLE 1 : Amino Acid Abbreviations 



Amino Acid 


Abbreviations 


Alanine 


AlaA 


Allosoleucine 


Alle 


Arginine 


AreR 


Asparagines 


AsnN 


aspartic acid 


AspD 


Cysteine 


CysC _j 


glutamic acid 


GluE 


Glutaniine 


GlnQ 


Glycine 


GlyG 


Histidine 


HisH 


Isolelucine 


Ilel 


Leucine 


LeuL 


Lysine 


LysK 


Phenylalanine 


PheF 


Proline 


ProP 


pyroglutamic acidp 


Glu 


Serine 


SerS 


Threonine 


ThrT 


Tyrosine 


TyrY 


Tryptophan 


TrpW 


Valine 


^ ValV 



Table 2. 

Original Residue Exemplary Conservative Substitutions, others are known in the art. 
Ala ser 
Arg lvs > S ln 
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10 



15 



Asn 


nrltv hie 
gill, lllo 


Asp 




Cys 


ser 


Gin 


asn, lys 


Glu 


asp 


Gly 


pro 


His 


asn; gin 


He 


leu; val 


Leu 


ile; val 


Lys 


arg; gin; 


Met 


Leu; ile 


Phe 


met; leu; tyr 


Ser 


thr 


Thr 


ser 


Trp 


tyr 


Tyr 


trp; phe 


Val 


ile; leu 



336. Substantial changes in function or immunological identity are made by selecting 
20 substitutions that are less conservative than those in Table 2, i.e., selecting residues that differ more 

significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the 
substitution, for example as a sheet or helical conformation, (b) the charge or hydrophobicity of the 
molecule at the target site or (c) the bulk of the side chain. The substitutions which in general are 
expected to produce the greatest changes in the protein properties will be those in which (a) a hydrophilic 
25 residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, 
phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a 
residue having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side .chain, e.g., 
phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this case, (e) by 
30 increasing the number of sites for sulfation and/or glycosylation. 

337. For example, the replacement of one amino acid residue with another that is biologically 
and/or chemically similar is known to those skilled in the art as a conservative substitution. For example, 
a conservative substitution would be replacing one hydrophobic residue for another, or one polar residue 
for another. The substitutions include combinations such as, for example, Gly, Ala; Val, Ile, Leu; Asp, 

35 Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. Such conservatively substituted variations of each 
explicitly disclosed sequence are included within the mosaic polypeptides provided herein. 

338. Substitutional or deletional mutagenesis can be employed to insert sites for N-glycosylation 
(Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other labile residues also may 
be desirable. Deletions or substitutions of potential proteolysis sites, e.g. Arg, is accomplished for 

40 example by deleting one of the basic residues or substituting one by glutaminyl or histidyl residues. 

339. Certain post-translational derivatizations are the result of the action of recombinant host 
cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post- 
translationally deamidated to the corresponding glutamyl and asparyl residues. Alternatively, these 
residues are deamidated under mildly acidic conditions. Other post-translational modifications include 
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hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, 
methylation of the o-amino groups of lysine, arginine, and histidine side chains (T.E. Creighton, Proteins: 
Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 79-86 [1983]), acetylation 
of the N-terminal amine and, in some instances, amidation of the C-terminal carboxyl. 
5 340. It is understood that one way to define the variants and derivatives of the disclosed proteins 

herein is through defining the variants and derivatives in terms of homology/identity to specific known 
sequences. For example, SEQ ID NO:l sets forth a particular sequence of TSG101 and SEQ ID NO:5 
sets forth a particular sequence of a retroviral GAG protein. Specifically disclosed are variants of these 
and other proteins herein disclosed which have at least, 70% or 75% or 80% or 85% or 90% or 95% 

10 homology to the stated sequence. Those of skill in the art readily understand how to determine the 
homology of two proteins. For example, the homology can be calculated after aligning the two 
sequences so that the homology is at its highest level. 

341. Another way of calculating homology can be performed by published algorithms. Optimal 
alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and 

15 Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and 

Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. 
Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
Group, 575 Science Dr., Madison, WI), or by inspection. 

20 342. The same types of homology can be obtained for nucleic acids by for example the 

algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl Acad. ScL USA 
86:7706-7710, 1989, Jaeger et al. Methods Enzymol 183:281-306, 1989 which are herein incorporated by 
reference for at least material related to nucleic acid alignment. 

343. It is understood that the description of conservative mutations and homology can be 
25 combined together in any combination, such as embodiments that have at least 70% homology to a 

particular sequence wherein the variants are conservative mutations. 

344. As this specification discusses various proteins and protein sequences it is understood that 
the nucleic acids that can encode those protein sequences are also disclosed. This would include all 
degenerate sequences related to a specific protein sequence, i.e. all nucleic acids having a sequence that 

30 encodes one particular protein sequence as well as all nucleic acids, including degenerate nucleic acids, 
encoding the disclosed variants and derivatives of the protein sequences. Thus, while each particular 
nucleic acid sequence may not be written out herein, it is understood that each and every sequence is in 
fact disclosed and described herein through the disclosed protein sequence. It is also understood that 
while no amino acid sequence indicates what particular DNA sequence encodes that protein within an 

35 organism, where particular variants of a disclosed protein are disclosed herein, the known nucleic acid 

sequence that encodes that protein in the particular organism from which that protein arises is also known 
and herein disclosed and described. 
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g) Pharmaceutical carriers/Delivery of pharamceutical products 

345 . As described herein, the compositions can also be administered in vivo in a 
pharmaceutical^ acceptable carrier. By "pharmaceutically acceptable" is meant a material that is not 
biologically or otherwise undesirable, i.e., the material may be administered to a subject, along with the 

5 nucleic acid or vector, without causing any undesirable biological effects or interacting in a deleterious 
manner with any of the other components of the pharmaceutical composition in which it is contained. 
The carrier would naturally be selected to minimize any degradation of the active ingredient and to 
minimize any adverse side effects in the subject, as would be well known to one of skill in the art. 

346. The compositions may be administered orally, parenterally (e.g., intravenously), by 

10 intramuscular injection, by intraperitoneal injection, transdermally, extracorporeally, topically or the like, 
including topical intranasal administration or administration by inhalant. As used herein, "topical 
intranasal administration" means delivery of the compositions into the nose and nasal passages through 
one or both of the nares and can comprise delivery by a spraying mechanism or droplet mechanism, or 
through aerosolization of the composition. Administration of the compositions by inhalant can be 

1 5 through the nose or mouth via delivery by a spraying or droplet mechanism. Delivery can also be 
directly to any area of the respiratory system (e.g., lungs) via intubation. The exact amount of the 
compositions required will vary from subject to subject, depending on the species, age, weight and 
general condition of the subject, the severity of the allergic disorder being treated, the particular nucleic 
acid or vector used, its mode of administration and the like. Thus, it is not possible to specify an exact 

20 amount for every composition. However, an appropriate amount can be determined by one of ordinary 
skill in the art using only routine experimentation given the teachings herein. 

347. Parenteral administration of the composition, if used, is generally characterized by injection. 
Injectables can be prepared in conventional forms, either as liquid solutions or suspensions, solid forms 
suitable for solution of suspension in liquid prior to injection, or as emulsions. A more recently revised 

25 approach for parenteral administration involves use of a slow release or sustained release system such 
that a constant dosage is maintained. See, e.g., U.S. Patent No. 3,610,795, which is incorporated by 
reference herein. 

348. The materials may be in solution, suspension (for example, incorporated into microparticles, 
liposomes, or cells). These may be targeted to a particular cell type via antibodies, receptors, or receptor 

30 ligands. The following references are examples of the use of this technology to target specific proteins to 
tumor tissue (Senter, et al., Bjojxffljugajg Chem., 2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer , 
60:275-281, (1989); Bagshawe, et al., Br. J. Cancer , 58:700-703, (1988); Senter, et al., Bioconjugate 
Chem. , 4:3-9, (1993); Battelli, et al., Harder Immunol. Immunother.. 35:421-425, (1992); Pietersz and 
MrKMrie. Immunolog. Reviews . 129:57-80, (1992); and Roffler, et al., Biochem. Pharmacol , 42:2062- 

35 2065, (1991)). Vehicles such as "stealth" and other antibody conjugated liposomes (including lipid 

mediated drug targeting to colonic carcinoma), receptor mediated targeting of DNA through cell specific 
ligands, lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting of 
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murine glioma cells hi vivo. The following references are examples of the use of this technology to target 
specific proteins to tumor tissue (Hughes et al., Cancer Research, 49:6214-6220, (1989); and Litzinger 
and Huang, Rinchimica et Bioohvsica Acta , 1 104:179-187, (1992)). In general, receptors are involved in 
pathways of endocytosis, either constitutive or ligand induced. These receptors cluster in clathrin-coated 
pits, enter the cell via clathrin-coated vesicles, pass through an acidified endosome in which the receptors 
are sorted, and then either recycle to the cell surface, become stored intracellular^, or are degraded in 
lysosomes. The internalization pathways serve a variety of functions, such as nutrient uptake, removal of 
activated proteins, clearance of macromolecules, opportunistic entry of viruses and toxins, dissociation 
and degradation of ligand, and receptor-level regulation. Many receptors follow more than one 
intracellular pathway, depending on the cell type, receptor concentration, type of ligand, ligand valency, 
and ligand concentration. Molecular and cellular mechanisms of receptor-mediated endocytosis has been 
reviewed (Brown and Greene, r»NA and Cell Biology 10:6, 399-409 (1991)). 

(1) Pharmaceutically Acceptable Carriers 

349. The compositions, including antibodies or other compositions which interact with any of the 
disclosed compositions, such as TSG101, Vps4A, or Vps4B such that TSG101, Vps4A, or Vps4B cannot 
interact with the retroviral GAG and/or inhibit retroviral budding, can be used therapeutically in 
combination with a pharmaceutically acceptable carrier. 

350. Suitable carriers and their formulations are described in Remington: The Science and 
Practice of Pharmacy (19th ed.) ed. A.R. Gennaro, Mack Publishing Company, Easton, PA 1995. 
Typically, an appropriate amount of a pharmaceutically-acceptable salt is used in the formulation to 
render the formulation isotonic. Examples of the pharmaceutically-acceptable carrier include, but are not 
limited to, saline, Ringer's solution and dextrose solution. The pH of the solution is preferably from 
about 5 to about 8, and more preferably from about 7 to about 7.5. Further carriers include sustained 
release preparations such as semipermeable matrices of solid hydrophobic polymers containing the 
antibody, which matrices are in the form of shaped articles, e.g., films, liposomes or microparticles. It 
will be apparent to those persons skilled in the art that certain carriers may be more preferable depending 
upon, for instance, the route of administration and concentration of composition being administered. 

351. Pharmaceutical carriers are known to those skilled in the art. These most typically would be 
standard carriers for administration of drugs to humans, including solutions such as sterile water, saline, 
and buffered solutions at physiological pH. The compositions can be administered intramuscularly or 
subcutaneously. Other compounds will be administered according to standard procedures used by those 
skilled in the art. 

352. Pharmaceutical compositions may include carriers, thickeners, diluents, buffers, preservatives, 
surface active agents and the like in addition to the molecule of choice. Pharmaceutical compositions may 
also include one or more active ingredients such as antimicrobial agents, antiinflammatory agents, 
anesthetics, and the like. 
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353. The pharmaceutical composition may be administered in a number of ways depending on 
whether local or systemic treatment is desired, and on the area to be treated. Administration may be topically 
(including ophthalmically, vaginally, rectally, intranasally), orally, by inhalation, or parenterally, for example 
by intravenous drip, subcutaneous, intraperitoneal or intramuscular injection. The disclosed antibodies can 

5 be administered intravenously, intraperitoneal^, intramuscularly, subcutaneously, intracavity, or 
transdermally. 

354. Preparations for parenteral administration include sterile aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene 
glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous 

10 carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, including saline and 

buffered media. Parenteral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and 
sodium chloride, lactated Ringers, or fixed oils. Intravenous vehicles include fluid and nutrient 
replenishes, electrolyte replenishes (such as those based on Ringer's dextrose), and the like. 
Preservatives and other additives may also be present such as, for example, antimicrobials, anti-oxidants, 

15 chelating agents, and inert gases and the like. 

355. Formulations for topical administration may include ointments, lotions, creams, gels, drops, 
suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily 
bases, thickeners and the like may be necessary or desirable. 

356. Compositions for oral administration include powders or granules, suspensions or solutions in 
20 water or non-aqueous media, capsules, sachets, or tablets. Thickeners, flavorings, diluents, emulsifiers, 

dispersing aids or binders may be desirable. 

357. Some of the compositions may potentially be administered as a pharrnaceuticaily acceptable 
acid- or base- addition salt, formed by reaction with inorganic acids such as hydrochloric acid, 
hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, and phosphoric acid, and 

25 organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, 

oxalic acid, malonic acid, succinic acid, maleic acid, and ftimaric acid, or by reaction with an inorganic 
base such as sodium hydroxide, ammonium hydroxide, potassium hydroxide, and organic bases such as 
mono-, di-, trialkyl and aryl amines and substituted ethanolamines. 

(2) Therapeutic Uses 

30 358. Effective dosages and schedules for administering die compositions may be determined 

empirically, and making such determinations is within the skill in the art. The dosage ranges for the 
administration of the compositions are those large enough to produce the desired effect in which the 
symptoms disorder are effected. The dosage should not be so large as to cause adverse side effects, such 
as unwanted cross-reactions, anaphylactic reactions, and the like. Generally, the dosage will vary with 

35 the age, condition, sex and extent of the disease in the patient, route of administration, or whether other 
drugs are included in the regimen, and can be determined by one of skill in the art. The dosage can be 
adjusted by the individual physician in the event of any counterindications. Dosage can vary, and can be 
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administered in one or more dose administrations daily, for one or several days. Guidance can be found 
in the literature for appropriate dosages for given classes of pharmaceutical products. For example, 
guidance in selecting appropriate doses for antibodies can be found in the literature on therapeutic uses of 
antibodies, e.g., Handbook of Monoclonal Antibodies, Ferrone et aL, eds., Noges Publications, Park 
5 Ridge, N.J., (1985) ch. 22 and pp. 303-357; Smith et aL, Antibodies in Human Diagnosis and Therapy, 
Haber et aL, eds., Raven Press, New York (1977) pp. 365-389. A typical daily dosage of the antibody 
used alone might range from about 1 ng/kg to up' to 100 mg/kg of body weight or more per day, 
depending on the factors mentioned above. 

359. Following administration of a disclosed composition, such as an antibody or molecule that 
10 inhibits, for example, TSG101, Vps4A, or Vps4B from interacting with GAG, for treating, inhibiting, or 

preventing an HIV infection, the efficacy of the therapeutic composition can be assessed in various ways 
well known to the skilled practitioner. For instance, one of ordinary skill in the art will understand that 
the compositions disclosed herein are efficacious in treating or inhibiting an HIV infection in a subject by 
observing that the composition reduces viral load or prevents a further increase in viral load. Viral loads 

15 can be measured by methods that are known in the art, for example, using polymerase chain reaction 

assays to detect the presence of HIV nucleic acid or antibody assays to detect the presence of HIV protein 
in a sample (e.g., but not limited to, blood) from a subject or patient, or by measuring the level of 
circulating anti-HIV antibody levels in the patient. Efficacy of the administration of the disclosed 
composition may also be determined by measuring the number of CD4 + T cells in the HIV-infected 

20 subject. An antibody treatment that inhibits an initial or further decrease in CD4 + T cells in an HIV- 
positive subject or patient, or that result in an increase in the number of CD4 + T cells in the HIV-positive 
subject, is an efficacious antibody treatment. 

360. The compositions that inhibit, for example, TSG101-PTAP interactions or TSG101 , Vps4A 
or Vps4B interactions disclosed herein may be administered prophylactically to patients or subjects who 

25 are at risk for being exposed to HIV or who have been newly exposed to HIV. In subjects who have been 
newly exposed to HIV but who have not yet displayed the presence of the virus (as measured by PCR or 
other assays for detecting the virus) in blood or other body fluid, efficacious treatment with an antibody 
of the invention partially or completely inhibits the appearance of the virus in the blood or other body 
fluid. 

30 361. Other molecules that interact with, for example, TSG101, Vps4A, or Vps4B to inhibit 

TSG101, Vps4A or Vps4B interactions which do not have a specific pharmaceutical function, but which 
may be used for tracking changes within cells or for the delivery of diagnostic tools for example can be 
delivered in ways similar to those described for the pharmaceutical products. 

362. The disclosed compositions and methods can also be used for example as tools to isolate 

35 and test new drug candidates for a variety of PTAP motif containing retroviral related diseases. They can 
also be used for the continued isolation and study, for example, the cell cycle. 

— 58 — 

BNSDOCJD: <WO 200403931 1A2__L> 



WO 2004/03931 1 PCT/US2003/0 15722 

7. Chips and micro arrays 

363. Disclosed are chips where at least one address is the sequences or part of the sequences set 
forth in any of the nucleic acid sequences disclosed herein. Also disclosed are chips where at least one 
address is the sequences or portion of sequences set forth in any of the peptide sequences disclosed 

5 herein. 

364. Also disclosed are chips where at least one address is a variant of the sequences or part of 
the sequences set forth in any of the nucleic acid sequences disclosed herein. Also disclosed are chips 
where at least one address is a variant of the sequences or portion of sequences set forth in any of the 
peptide sequences disclosed herein. 

10 8. Kits 

365. Disclosed herein are kits that are drawn to reagents that can be used in practicing the 
methods disclosed herein. The kits can include any reagent or combination of reagent discussed herein or 
that would be understood to be required or beneficial in the practice of the disclosed methods. For 
example, the kits could include primers to perform the amplification reactions discussed in certain 

1 5 embodiments of the methods, as well as the buffers and enzymes required to use the primers as intended. 
D. Methods of making the compositions 

366. The compositions disclosed herein and the compositions necessary to perform the disclosed 
methods can be made using any method known to those of skill in the art for that particular reagent or 
compound unless otherwise specifically noted. 

20 1. Nucleic acid synthesis 

367. For example, the nucleic acids, such as, the oligonucleotides to be used as primers can be 
made using standard chemical synthesis methods or can be produced using enzymatic methods or any 
other known method. Such methods can range from standard enzymatic digestion followed by nucleotide 
fragment isolation (see for example, Sambrook et a/., Molecular Cloning: A Laboratoiy Manual, 2nd 

25 Edition (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989) Chapters 5, 6) to purely 
synthetic methods, for example, by the cyanoethyl phosphoramidite method using a Milligen or Beckman 
System lPlus DNA synthesizer (for example, Model 8700 automated synthesizer of Milligen-Biosearch, 
Burlington, MA or ABI Model 380B). Synthetic methods useful for making oligonucleotides are also 
described by Ikuta et aL, Amu Rev. Biochem. 53:323-356 (1984), (phosphotriester and phosphite-triester 

30 methods), and Narang et al., Methods EnzymoL, 65:610-620 (1980), (phosphotriester method). Protein 
nucleic acid molecules can be made using known methods such as those described by Nielsen et al, 
Bioconjug. Chem. 5:3-7 (1994). 

2. Peptide synthesis 

368. One method of producing the disclosed proteins, such as SEQ ID NO:23, is to link two or 
35 more peptides or polypeptides together by protein chemistry techniques. For example, peptides or 

polypeptides can be chemically synthesized using currently available laboratory equipment using either 
Fmoc (9-fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) chemistry. (Applied 
Biosystems, Inc., Foster City, CA). One skilled in the art can readily appreciate that a peptide or 
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polypeptide corresponding to the disclosed proteins, for example, can be synthesized by standard 
chemical reactions. For example, a peptide or polypeptide can be synthesized and not cleaved from its 
synthesis resin whereas the other fragment of a peptide or protein can be synthesized and subsequently 
cleaved from the resin, thereby exposing a terminal group which is functionally blocked on the other 
5 fragment. By peptide condensation reactions, these two fragments can be covalently joined via a peptide 
bond at their carboxyl and amino termini, respectively, to form an antibody, or fragment thereof. (Grant 
GA (1992) Synthetic Peptides: A User Guide. W.H. Freeman and Co., N.Y. (1992); Bodanslcy M and 
Trost B., Ed. (1993) Principles of Peptide Synthesis. Springer- Verlag Inc., NY (which is herein 
incorporated by reference at least for material related to peptide synthesis). Alternatively, the peptide or 
10 polypeptide is independently synthesized in vivo as described herein. Once isolated, these independent 
peptides or polypeptides may be linked to form a peptide or fragment thereof via similar peptide 
condensation reactions. 

369. For example, enzymatic ligation of cloned or synthetic peptide segments allow relatively 
short peptide fragments to be joined to produce larger peptide fragments, polypeptides or whole protein 

15 domains (Abrahmsen L et al., Biochemistry, 30:4151 (1991)). Alternatively, native chemical ligation of 
synthetic peptides can be utilized to synthetically construct large peptides or polypeptides from shorter 
peptide fragments. This method consists of a two step chemical reaction (Dawson et al. Synthesis of 
Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). The first step is the 
chemoselective reaction of an unprotected synthetic peptide— thioester with another unprotected peptide 

20 segment containing an amino-terminal Cys residue to give a thioester-linked intermediate as the initial 
covalent product. Without a change in the reaction conditions, this intermediate undergoes spontaneous, 
rapid intramolecular reaction to form a native peptide bond at the ligation site (Baggiolini M et al. (1992) 
FEBS Lett. 307:97-101 ; Clark-Lewis I et al„ J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., 
Biochemistry, 30:3128 (1991); Rajarathnam K et ah, Biochemistry 33:6623-30 (1994)). 

25 370. Alternatively, unprotected peptide segments are chemically linked where the bond formed 

between the peptide segments as a result of the chemical ligation is an unnatural (non-peptide) bond 
(Schnolzer, M et al. Science, 256:221 (1992)). This technique has been used to synthesize analogs of 
protein domains as well as large amounts of relatively pure proteins with full biological activity (deLisle 
Milton RC et al., Techniques in Protein Chemistry IV. Academic Press, New York, pp. 257-267 (1992)). 

30 3. Process for making the compositions 

37 1 . Disclosed are processes for making the compositions as well as making the intermediates 
leading to the compositions. There are a variety of methods that can be used for making these 
compositions, such as synthetic chemical methods and standard molecular biology methods. It is 
understood that the methods of making these and the other disclosed compositions are specifically 

35 disclosed. 
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E. Methods of using the compositions 

1. Methods of using the compositions as research tools 

372. The disclosed compositions can be used in a variety of ways as research tools. For example, 
the disclosed coordinates and storage media containing them, can be used to isolate molecules that 

5 interact with the TSG1 01 , Vps4A, or Vps4B or GAG molecule, 

373. The compositions can be used for example as targets in combinatorial chemistry protocols 
or other screening protocols to isolate molecules that possess desired functional properties related to 
binding TSG101 , Vps4A, or Vps4B, such that TSG101, Vps4A, or Vps4B cannot interact with retroviral 
GAG proteins and/or inhibit retroviral budding. 

10 F. Examples 

374. The following examples are put forth so as to provide those of ordinary skill in the art with a 
complete disclosure and description of how the compounds, compositions, articles, devices and/or 
methods claimed herein are made and evaluated, and are intended to be purely exemplary of the 
invention and are not intended to limit the scope of what the inventors regard as their invention. Efforts 

15 have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some 
errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, 
temperature is in °C or is at ambient temperature, and pressure is at or near atmospheric. 
1. Example 1 TsglOl, Vps4, and HIV-1 Budding 
a) Results 

20 (1) Human TsglOl Binds Specifically to the PTAP Motif of HJV-1 p6 

375. Disclosed is a strategy to screen for proteins that could bind to the PTAP late domain 
within HIV-1 p6 and obtained similar results. Briefly, HIV-l NVU/BR5 p6 was used as bait to screen a 
human spleen cDNA prey library for potential binding partners. Genes encoding nearly fUll length 
TsglOl were isolated twice, and were the only genes detected and confirmed in these screens. In 

25 subsequent experiments, it was shown that full length TsglOl bound wild type p6 in directed two-hybrid 
liquid culture assays, resulting in high levels of P-galactosidase activity (>300-fold over background; not 
shown). Three different p6 point mutants (P7L, A9R and P10L) were used to test whether the TsglOl 
binding interaction required the PTAP motif within HIV-1 p6, and all three reduced P-galactosidase 
activity to background levels. Each of these point mutations also arrests HIV-1 budding at a late stage 

30 (Huang et al., 3 995). Hence, Tsgl 01 was judged an attractive candidate for the cellular factor that binds 
the PTAP late domain of HIV-1 p6 and facilitates viral budding. 

376. Directed yeast two-hybrid experiments were further used to map the primary p6 binding site 
to the N-terminal half of TsglOl (residues 1-207, data not shown). Comparative sequence analyses have 
suggested that this region of TsglOl contains a domain that belongs to the ubiquitin enzyme 2 variant 

35 (UEV) protein family (Koonin and Abagyan, 1997; Ponting et ah, 1997; VanDemark et al., 2001). UEV 
proteins are homologous to the E2 class of ubiquitin conjugating enzymes, but lack the active site 
•cysteine residue required to make a transient thioester bond during ubiquitin transfer, and are presumably 
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not enzymatically active. A construct spanning TsglOl residues 1-145 was used to characterize the 

interaction between HI V- 1 ni_4-3 p6 and the TsglOl UEV domain in vitro. Recombinant TsglOl UEV was 

expressed in £. coli, purified to homogeneity, and tested for binding to immobilized GST-p6 fusion 

proteins using a surface plasmon resonance biosensor (Fig. 1). TsglOl UEV exhibited concentration- 

5 dependent p6 binding, and equilibrium responses fit a simple 1 : 1 binding model with an equilibrium 

dissociation constant (K rf ) of 27±5 jxM. As summarized in Fig. 1 A, a series of constructs spanning p6 

residues 1-13, 1-27, 1-33, and 1-52 (full length p6) all bound TsglOl UEV with similar affinities. Thus, 
the primary TsglOl binding site is located within the first 13 residues of p6. 

377. Site-directed mutagenesis of p6 was then used to map the TsglOl binding site more 

10 precisely. Residues 4-13 within full length p6 were substituted individually with alanine (Fig. IE, dark 
bars) or with residues known to block virus release (light bars). Mutations in the four central PTAP 
residues significantly reduced TsglOl binding (>2 kcal/mol loss of binding free energy). Mutations at 
three flanking sites (E6A, PI 1 A and E13A) moderately reduced the binding affinity (0.3-0.5 kcal/mol), 
indicating that these residues also contributed to TsglOl UEV binding. Alanine substitutions at three 

15 other sites (R4A, P5A, and E12A) had no significant effect on TsglOl UEV binding. These experiments 
demonstrate that the TsglOl UEV domain binds directly and specifically to HIV-1 p6 and that the p6 
PTAP motif is the energetically dominant binding epitope. 

(2) Ubiquitin Modification of HIV-1 p6 Enhances TsglOl Binding 

378. The enzymes that conjugate Ub onto HIV-1 Gag are not yet known, so models for p6-Ub 
20 conjugates were created by fusing the Ub protein in frame to the C-termini of full length p6, P633, and 

P627 constructs (Fig. 1 A). Analogous approaches have been employed by others to rescue Ub-dependent 
defects in endocytosis (Shih et al., 2000) and RSV budding (Patnaik et aL, 2000). In all three p6-Ub 
constructs, the presence of ubiquitin increased the affinity of TsglOl UEV binding approximately 10- 
fold (ave. K d =2.3 pM; Figs. 1 A and D). TsglOl UEV even bound weakly to Ub alone (Fig. ID, est 

25 Kd=510±35 juM). These experiments demonstrate that TsglOl UEV, though lacking enzymatic activity, 
has retained Ub binding activity and can bind cooperatively to ubiquitin and to the HIV-1 p6 late domain. 
This is consistent with TsglOl binding ubiquitinated Gag molecules even more tightly in vivo if the UEV 
domain senses the native Ub isopeptide geometry or if oligomerization of full length TsglOl increases its 
avidity for the assembling Gag lattice. 

30 (3) TsglOl is Required for Efficient Release of HIV-1 from 293T Cells 

379. It was important to develop systems to test the requirement for cellular TsglOl in HIV-1 
budding. The HIV-1 p6 late domain can mediate virus release from human embryonic kidney 293T cells 
(Yuan et aL, 2000), and this is confirmed herein that a series of mutations in the p6 PTAP sequence that 
block HIV-1 release and replication in other cell lines also blocked release of infectious HTV-1 particles 

35 in 293T cells. TsglOlwas depleted from 293T cells using small interfering RNAs (siRNA) (Elbashir et 
al, 2001). The system was optimized until it became possible to deplete endogenous TsglOl to nearly 
undetectable levels (Fig. 2A, lanes 2,8,10). Briefly, 293T cells were transfected twice at 24 h intervals 
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with a small interfering RNA duplex homologous to nucleotides 413-433 of the TsglOl coding sequence 
(denoted siRNA). TsglOl levels were analyzed after an additional 24 h by Western blotting of 
cytoplasmic extracts from the bulk culture. Under these conditions, traces of TsglOl protein could only 
be detected in highly overexposed Western blots, whereas levels of control proteins were unaffected. 
5 Nuclear and membrane fractions were also depleted of TsglOl . A heterologous control RNA duplex of 
inverted sequence (denoted siRNA| N v) did not affect cellular TsglOl levels, demonstrating the specificity 
of siRNA. Although others have reported that altering TsglOl levels can affect cell proliferation (Zhong 
et al. s 1998), we observed only a very slight growth reduction in TsglOl -depleted cells over the 72h time 
course. 

10 380. Tsgl 01 protein was reintroduced into TsglOl -depleted cells by co-transfecting a plasmid 

encoding FLAG-tagged TsglOl protein with 7 silent mutations at the target site that rendered the gene 
resistant to RNA interference (denoted Tsg*-FLAG). As shown in Figs. 2A and B, this "resistant" Tsg*- 
FLAG protein was not depleted by siRNA (lanes 4,6). In contrast, endogenous TsglOl and exogenous 
Tsg-FLAG expressed from wt genes were always efficiently depleted by siRNA treatment 

15 38L To test whether TsglOl was required for HIV-1 budding, TsglOl-depIeted cells were co- 

transfected with a proviral HIV-1 R9 expression construct during the second siRNA transfection. 
Depletion of TsglOl very significantly reduced the release of virion-associated MA and CA proteins as 
analyzed in Western blots, and also reduced viral infectivity in single cycle MAGIC infectivity assays 
(Figs. 2C and 2E, lanes 2,8,10). Viral titers were reduced 10- to 50-fold in multiple repetitions of this 

20 experiment. Both virus release and infectivity were restored to normal levels when the Tsg*-FLAG 
protein was reintroduced (lanes 4,6), formally demonstrating that TsglOl is required for the release of 
infectious HIV-1 from 293T cells. Reductions in virus release in the Tsgl 01 -depleted cells did not 
reflect general defects in HIV-1 Gag protein synthesis, stability, or processing (Fig. 2D). However, 
TsglOl depletion reproducibly increased accumulation of the CA-SP1 Gag processing intermediate in the 

25 cytoplasm, consistent with a defect in the final stages of particle assembly (Gottlinger et aL, 1991). 

382. The effect of TsglOl depletion on the release of murine leukemia virus (MLV), which uses 
the alternative "PY" late domain pathway (Yuan et aL, 1999) was analyzed. Mutating the MLV Gag pl2 
PPPY motif (to A AAA) blocked particle release and infectivity (Fig. 3). However, depletion of TsglOl 
had only minor, if any, effect on MLV release and infectivity. These experiments demonstrate that 

30 Tsgl 0 1 -depleted 293T cells can still support replication of retroviruses other than HIV-1, provided they 
are not released via the PTAP-dependent pathway. 

(4) A Functional Vps Pathway Is Required for HIV-1 Release from 
293T Cells 

383, TsglOl functions in the vacuolar protein sorting (Vps) pathway, in which membrane- 

35 associated proteins are sorted through a series of endosomal compartments for eventual degradation in 
the lysosome (Lemmon and Traub, 2000). The effects of other "functional knockouts" of the 
TsglOl/Vps pathway on HIV-1 budding were analyzed. Although TsglOl is predominantly cytosolic at 
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steady state (Xie et ah, 1998; Zhong et al., 1998), the protein appears to cycle dynamically through the 
endosomal network. Another protein in the Vps pathway, Vps4, functions in TsglOl cycling and 
endosomal trafficking (Babst et al, 1998; Bishop and Woodman, 2000; Bishop and Woodman, 2001). 
Vps4 is an ATPase of the AAA protein family that appears to utilize the energy derived from ATP 
5 hydrolysis to disassemble endosome-associated Vps complexes and thereby allow multiple rounds of 
sorting. Overexpressing ATPase-defective GFP-Vps4 fusion proteins induces formation of enlarged 
endosomes (called "class E" compartments in yeast) that are defective in the sorting and recycling of 
endocytosed substrates (Bishop and Woodman, 2000). The Vps4 mutants also prevent normal TsglOl 
trafficking because the protein is trapped on the surface of these aberrant endosomes (Bishop and 

10 Woodman, 2001). 

384. Vectors expressing ATPase-defective GFP-Vps4 proteins were co-transfected together with 
the proviral HIV-1 R9 expression vector. Two different dominant negative mutations were used: K173Q, 
which blocks ATP binding (Vps4 K i 7 3Q), and E228Q, which blocks ATP hydrolysis (Vps4 E2 28Q). Protein 
levels and viral infectivity were again analyzed by Western blotting and in MAGIC assays. As shown in 

15 Figs. 4A and 4B, co-expression of the mutant GFP-Vps4 proteins inhibited particle production in a dose- 
dependent fashion, with release of infectious particles reduced more than 1000-fold at the highest levels 
of Vps4E228Q tested. Co-transfection with the wild type GFP-Vps4 construct reduced particle production 
and infectivity only slightly (-2-3-fold). This reduction was reproducible, however, and may reflect 
stoichiometric imbalances in the Vps pathway arising from Vps4 overexpression. Expression levels for 

20 all of the GFP-Vps4 proteins paralleled the levels of transfected DNA (as detected by anti-GFP Western, 
Fig. 4A, bottom), and none of the Vps4 proteins altered Gag protein synthesis, stability, or cell viability 
significantly (Fig. 4A, middle). Other controls for this experiment included: 1) mock transfections (lane 
1), 2) co-transfection of R9 with a vector expressing GFP only (no effect on budding; lane 2), and 3) 
transfection of an R9 vector encoding a PTAP to LIRL mutant HIV-1 provirus (arrested budding; lane 3). 

25 385. As the Vps4 mutants interfered with endosomal trafficking, the possibility that particle 

release might have been inhibited by altered trafficking, activation, or assembly of viral proteins other 
than Gag (e.g., protease (PR), envelope (Env), or viral protein U) was considered. This was not the case, 
however, because the mutant GFP-Vps4 proteins also blocked release of viral constructs missing PR or 
Env and blocked release of Gag-GFP alone (Fig. 4C, lanes 5, 6). As expected, Gag-GFP was released 

30 efficiently in the absence of the dominant negative Vps constructs, but was blocked by mutation of the 
Gag p6 PTAP sequence (lane 3). These experiments demonstrate that the Vps4-induced block to HIV-1 
release does not require any other viral proteins, consistent with the idea that the p6 domain of the 
assembling Gag particle recruits components of the TsglOl/Vps pathway to assist in budding. 

(5) A Functional Vps Pathway Is Required for Release of Infectious 

35 HIV-1 from Human T Cell Lines 

386. It was also tested whether an intact PTAP motif and a functional Vps pathway were 
required for release of infectious HIV-1 particles from T cells, which are HIV-1 hosts in vivo. Although 
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the transfection efficiencies of T cell lines are low, viral titers produced from MT4 cells 72 h post- 
transfection were sufficiently high to quantitate using the MAGIC assay. Mutation of HIV- 1 p6 PTAP 
reduced viral titers >300-fold (Fig. 4D) and blocked spreading infections in MT4 cultures (growth curves 
not shown). Thus, HIV-1 also requires p6 PTAP to replicate in this non-adherent T cell line. Inhibition 
5 of the Vps pathway by overexpression of the dominant negative Vps4 E22 8Q mutant also severely reduced 
infectious HIV-1 titers (>200-fold), whereas overexpression of the wt Vps4 protein had only modest 
effects (~3-foId reduction). Similar results were obtained in the CEMss T cell line (data not shown). 
Thus, we conclude that HIV-1 requires both a PTAP motif and a functional Vps pathway to replicate in 
multiple cell types, including T cells. 

10 (6) A Functional Vps Pathway Is Required for Release of Infectious 

MLV from 293T Cells 

387. The effects of mutant GFP-Vps4 protein overexpression on MLV particle release and 
infectivity in 293T cells were also tested (Fig. 5). In this case, virion-associated CA release was blocked, 

> with a concomitant increase in cytoplasmic levels of the unprocessed Gag protein (lanes 5, 6). Similarly, 
15 infectivity was dramatically reduced (>800-fold for Vps4e2 2 8q). Thus, the block to viral release imposed 
by the Vps4 mutants is more general than that imposed by depleting TsglOl , which appears to be HIV- 
specific. These observations indicate that late endosomal trafficking, which is inhibited by the dominant 
negative Vps4 proteins, is required for virus release through both the PTAP and PPPY motifs. 

(7) TsglOl Depletion and Vps4 Mutations Inhibit Late Stages of Virus 
20 Budding 

388. Thin section electron microscopy was used to characterize the defects in viral release from 
TsglOl -depleted and Vps4-inhibited cells. In both experiments, HIV-1 budding arrested with "late 

domain" phenotypes that were very similar to those observed for control PTAP" mutant viruses (Fig. 6A). 

In each case, isolated immature particles remained connected to the plasma membrane via membrane 

25 stalks, and budding particles also often formed "clusters" of interconnected virions. Although budding 
HIV-1 particles were occasionally observed in control cells, these "late domain" mutant phenotypes were 
rarely seen. The dominant negative Vps4 proteins also arrested MLV particle release at a late stage, with 
a phenotype that was very similar to that observed for control PPPY~ mutant viruses (Fig. 6B). Thus, 
HIV-1 and MLV Gag proteins localized to the plasma membrane of both TsglOl -depleted and Vps4- 

30 inhibited cells, and appeared to initiate assembly normally. However, the viral particles failed to 

complete membrane fission, implying that TsglOl (for HIV-1) and other components of the Vps pathway 
(for both HIV-1 and MLV) participate in this final stage of viral particle release. 

389. Disclosed herein TsglOl UEV is a multifunctional domain that can simultaneously bind 
both ubiquitin and the p6 PTAP motif, consistent with TsglOl detecting or participating in the Ub 

35 transfer event required for HIV-1 budding. Disclosed herein the three-dimensional structure of TsglOl 
UEV exhibits an E2-like fold, but that the final two helices in the canonical E2 fold are replaced by the 
PTAP binding site. 
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b) Experimental Procedures 

(1) Yeast Two-hybrid Experiments 

390. Yeast two-hybrid reagents and techniques have been reviewed (Bartel and Fields, 1997). 
Briefly, a bait construct expressing HIV-Wbrs p6 (Gag residues 449-500) fused to the C-terminus of 
Gal4 DNA binding domain (residues 1-147) was transformed into yeast strain PNY200 {MATatrpl-901 
leu2-3, 1 1 2 wa3-52 his3-200 ade2 gal4A galSO). Prey constructs expressing cDNA from human spleen 
poly(A) + RNA (Clontech) fused to the C-terminus of the Gal4 activation domain (residues 768-881) were 
transformed into the yeast strain BK100 (MATa trpl-901 Ieu2-3,J12 ura3-52 his3-200 gaUA galSO 
L YS2: : GAL-HIS3 GAL2-ADE2 met2::GAL7-lacZ), which incorporates the multiple reporter system 
(James et al., 1996). PNY200 cells (bait) were mated with BK100 cells (preys) and diploid yeast cells 
were selected in the presence of 3 mM 3-amino-l,2,4-triazole for the ability to synthesize tryptophan 
(bait), leucine (prey), histidine (bait/prey interaction), and adenine (bait/prey interaction). Approximately 
4.2 million bait/prey pairs were tested, and preys encoding TsglOl residues 7-390 were isolated twice 
(TsglOl NCBI accession # NM_006292). 

391. Interactions were confirmed by transforming bait and prey constructs into naive yeast cells 
and performing liquid culture P-galactosidase assays. Cultures were grown overnight in synthetic media 
(-Leu, -Trp, + glucose) in 96 well plates, normalized for optical density, and lysed by addition of 6X 
lysis/substrate solution in 6X Z-buffer (60mM KC1, 6mM MgS0 4 , 360mM Na 2 HP0 4) 240 mM NaH 2 P0 4 , 
6mg/ml CPRG, 0.12U/ml lyticase, 0.075% NP-40). Cultures were incubated for 2 hr at 37°C, clarified 
by centrifugation, and the optical absorbance of each supernatant was measured (575 nm). Significant 
responses were observed for p6 binding to full length TsglOl (>300-fold) and an N-terminal TsglOl 
construct (residues 1-207; >30-fold over background), but not to a C-terminal TsglOl construct (residues 
207-390). 

(2) Recombinant Proteins 

392. DNA encoding TsglOl residues 1-145 (the UEV domain) was cloned into pETl Id 
(Novagene), expressed in BL21(DE3) E. coli cells, and purified by conventional chromatography. Full 
details of the purification procedure will be published elsewhere (Pornillos et al., in preparation). 
Expression of mutant GST-p6 NL4 - 3 and GST-Ub proteins followed the procedures described for wt GST- 
p6. (Jenkins et al., 2001). 

(3) Surface Plasmon Resonance 

393. Surface plasmon resonance measurements were performed at 20°C using a BIACORE 3000 
(Biacore AB, Uppsala, Sweden) equipped with a research-grade CM5 sensor chip. -10 kRU anti-GST 
Ab was immobilized on all four flow cells using traditional amine-coupling chemistry (Johnsson et al., 
1991). Soluble lysates from E. coli expressing GST-p6 proteins were diluted 10X in running buffer (20 
mM Na 3 P0 4 , 150 mM NaCl, 0.005% P20, 50 ug/mL BSA, pH 7.2) and captured individually on three of 
the antibody surfaces at densities of 1-2.5 kRU. ~2 kRU recombinant GST protein was captured on the 
fourth (reference) antibody surface. 
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394. TsglOl UEV in running buffer was injected in triplicate over the four flow cells at 

concentrations of 0, 0.09, 0.27, 0.82, 2.5, 7.4, 22, 67, and 200 \xM (50 (iL/min). Data were collected at a 
rate of 2 Hz during the 30 s association and dissociation phases. All Tsg 101/p6 interactions reached 
equilibrium rapidly and dissociated completely within seconds. To obtain equilibrium dissociation 
5 constants, the responses at t = 25 s were fit to simple 1:1 interaction binding isotherms (Myszka, 1999). 

(4) Plasmid Constructs 

395. Vector R9 contains a full-length HIV-l NL 4-3 expression clone (von Schwedler et al., 1998). 
Kunkel mutagenesis was used to mutate the HIV-1 Gag p6 PTAP sequence to LIRL without disrupting 
the overlapping pol reading frame (Huang et al., 1995). R9AEnv and R9APR were gifts from Chris 

1 0 Aiken, Vanderbilt University. pGag-GFP contains the rev-independent HIV- 1 hxb2 Gag sequence fused 
to EGFP, and was a gift from Marilyn Resh (Hermida-Matsumoto and Resh, 2000). The PTAP sequence 
of Gag p6 was also mutated to LIRL in this construct. The packaging vector pCLeco was used for MLV 
production (Naviaux et al., 1996). The MLV Gag pl2 sequence PPPY was mutated to AAAA (Yuan et 
al, 1999) by PCR megaprimer mutagenesis of pCLeco. DNA encoding full length TsglOl was fused to 

15 the FLAG coding sequence using PCR and cloned into pIRES2-EGFP (Clontech). Seven silent mutations 
that render TsglOl -FLAG resistant to siRNA (denoted Tsg*-FLAG in Fig. 2) were introduced into the 
wild type TsglOl coding region 4 1 3 A ACCTCC AGTCTTCTCTCGTC433 using Kunkel mutagenesis 
(mutated nucleotides are underlined). The complete 437aa reading frame of Vps4 was repaired and 
amplified from ATCC #8 1 449, and cloned into pEGFP-Cl (Clontech) to create a GFP-Vps4 fusion 

20 protein. ATPase point mutations K3 73Q and E228Q were introduced by PCR megaprimer mutagenesis. 
Cloning details are available upon request 

(5) Small Interfering RNA (siRNA) 

396. 21-nt RNA duplexes with symmetric 2-nt 3' (2'-deoxy) thymidine overhangs corresponding 
to TsglOl coding nt 413-433 were synthesized and HPLC purified. RNA sequences: Sense, 5' CCU CCA 

25 GUC UUC UCU CGU CTT; Antisense, 5' GAC GAG AGA AGA CUG GAG GTT; Inverted sense, 5' 
CUG CUC UCU UCU GAC CUC CTT; Inverted antisense, GGA GGU CAG AAG AGA GCA GTT. 
Oligonucleotides were annealed as described (Elbashir et ah, 2001). 

(6) Protein Expression in 293T and T Cells 

397. 293T cells were co-transfected with the indicated amounts of R9 and Vps4 expression 
30 vectors in 6 well plates using the calcium phosphate method as described (von Schwedler et al., 1998). 

All other transfections were performed as described in the text using Lipofectamine2000 (Invitrogen) 
following the manufacturer's instructions. Cells and supernatants were harvested 24-72h post transfection 
as described in the text. 

(7) Western Blots 

35 398. Cytoplasmic and sucrose-pelleted viral lysates were resolved by SDS-PAGE and blotted for 

ECL as described (von Schwedler et al., 1998). The following primary antibodies were used: rabbit anti- 
HIV CA antibody from Hans-Georg Krausslich, Heidelberg, Germany (at 1 :2000); rabbit anti-HIV MA 
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from Didier Trono, Geneva, Switzerland (at 1:50,000); murine monoclonal anti-Tsgl01-4A10 from 
GeneTex, Inc. (at 1:1000), murine monoclonal anti-FLAG M2 from Sigma (at 1:3000), rabbit anti-14-3- 
3(5 K19 from Santa Cruz Biotechnology (at 1:3000), anti-GFP-HRP from Clontech (at 1:500), and goat 
anti-MLV p30 from John Elder, Scripps Institute, La Jolla (at 1:1000), 
5 (8) Viral Replication Assays 

399. Infectivity of HIV- 1 released into the supematants from cells transfected with R9 constructs 
was assayed by MAGIC assay as described (von Schwedler et al., 1998) except that infections were 
performed at 3 different dilutions in triplicate in 48 well plates. Blue cells and syncytia were counted 2d 
after infection. MLV infectivity was assayed by packaging pCL-lacZ in the M-MLV retroviral vector 

10 pCLeco (Naviaux et al., 1996), transducing NIH3T3 cells, and staining cells for p-galactosidase activity 2 
days later, as described (von Schwedler et aL, 1998). 

(9) Electron Microscopy 

400. Transfected 293T cells were fixed with 2.5% glutaraldehyde/ 1% paraformaldehyde for 
30min and stained as described (von Schwedler et al., 1998), except that samples were dehydrated in 

15 ethanol and embedded in Spurr's plastic. 

2. Example 2 Vps4b 

401. The human Skdl/Vps4b gene was cloned using standard cloning technology based on the 
homology of the yeast Skdl gene. Variant Vps4b nucleic acids were produced using standard 

20 recombinant biotechnology to produce the orthologous mutations K180Q and E235Q. These were made 
as DsRed fusion proteins. The K180Q and E235Q mutations were assayed for their involvement in HIV 
budding (Figure 7). A Rev-independent GagGFP construct (point cyt anti-CA) was used (Dr. Marylin 
Reshconstruct). 293T cells were co-transfected with the an expressable Vps4b nucleic acid and an 
expressable GagGFP construct. Upon co-transfection with the DsRed-Skdl fusion genes gag budding 

25 was blocked by the mutants, but promoted by wild type Vps4b. In immunofluorescence, gag-GFP is 
more than usually stuck at the membrane. 

402. Not only can the Vps4b mutants, K180Q and E235Q block GAG budding they can also 
block viral budding and viral infectivity. The mutant Vps constructs were cotransfected with HIV virus 
in cells. Western analysis showed that production of mature GAG proteins, CA and MA was blocked by 

30 the K180Q and E235Q mutant Vps4b constructs, as well as the E228Q mutant Vps4a construct (Figure 
8A). This indicates that a block in the GAG processing and budding was caused by both VPs4a and 
Vps4b mutants. Consistent with this result, a MAGIC infectivity analysis showed that the Vps4a and 
Vps4b mutants blocked production of infectious viral particles as determined by the MAGIC assay 
(Figure 8B). 

35 403. Vps4a and Vps4b both bind Bc2. GST pulldown experiments showed that Bc2 

coprecipitated with both Vps4a and Vps4b (Figure 9A). Overexpression of a DsRed-Bc2 fusion protein 
along with either cotransfected HIV GAG or HIV virus, showed that production of mature GAG particles 
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(Figure 9B) and budding of viral particles (Figure 9C). was blocked by the overexpression of the Bc2. 
Overexpression of cytoplasmic GAG and VPS-25, another yeast ortholog protein did not effect GAG 
maturation or viral budding (Figure 9B and C). Furthermore, overexpression of Bc2 blocked production 
of infectious viral particles as determined by a MAGIC assay. (Figure 9D). This is consistent with Bc2 
5 causing a stoichiometric shift in Vps4 associations which prevents Vps4 associations needed for GAG 
maturation and viral budding. 
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EN.REFLIST 
15 H. Sequences 

1. SEQ ID NO:l TSG101 protein sequence one example Genbank accession number 
Q99816. 

2. SEQ ID NO:2 DNA for TSG101 NM_006292. human TSG101 gene 

3. SEQ ID NO:3 TSG101 protein encoded by SEQ ID NO:2 

20 4. SEQ ID NO:4. Vps4A protein Genbank accession number AAG01470, 

5- SEQ ID NO:5: Vps4A Protein Genbank accession number NP_037377 

6. SEQ ID NO:6 Vps4B Protein Genbank Acession Number AAG01471. 

7. SEQ ID NO:7 Vps4 Mouse Genbank Accession No. NP__ 569053 

8. SEQ ID NO:8 Vps4B gene Genbank Accession Number AF282904 
25 9. SEQ ID NO:9 Vps4B protein encoded by SEQ ID NO:8 AF282904 

10. SEQ ID NO:10 Vps4A human gene Genbank accession number AF282903 

11. SEQ ID NO:ll Vps4A Protein encoded by SEQ ID NO:10 AF282903 
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VI. CLAIMS 

What is claimed is: 

I. A method for reducing interactions between TSG101 and Vps4A, Vps4B, Ub, or Bc2 , comprising 
incubating an inhibitor of the interaction between TSG101 and Vps4A, Vps4B, Ub, or Bc2. 

5 2. A method for inhibiting HIV budding comprising administering an inhibitor of the interaction 

between TSG 1 0 1 and Vps4A, Vps4B, Ub, or Bc2 . 

. 3. A method of treating a subject comprising administering to the subject an inhibitor of HIV 
budding, wherein the inhibitor reduces the interaction between TSG 101 and Vps4A, Vps4B, Ub, or Bc2 , and 
wherein the subject is in need of such treatment. 

10 4. A method of identifying an inhibitor of an interaction between TSG101 and Vps4A, Vps4B, Ub, 

or Bc2 comprising incubating a library of molecules with TSG 101 forming a mixture, and identifying the 
molecules that disrupt the interaction between TSG101 and Vps4A, Vps4B, Ub, or Bc2, wherein the 
interaction disrupted comprises an interaction between the Vps4A, Vps4B, Ub, or Bc2 and an amino acid of 
TSG101. 

15 5. The method of claim 5, wherein the step of isolating comprises incubating the mixture with 

molecule comprising a Vps4A, Vps4B, Ub, orBc2 variant or fragment thereof 

6. A method of identifying an inhibitor of an interaction between TSG 101 and Vps4A, Vps4B, Ub, 
or Bc2 comprising incubating a library of molecules with Vps4A, Vps4B, Ub, or Bc2 forming a mixture, and 
identifying the molecules that disrupt the interaction between Vps4A, Vps4B, Ub, or Bc2 and TSG101, 

20 wherein the interaction disrupted comprises an interaction between the Vps4A, Vps4B, Ub, or Bc2 and an 
amino acid of TSG 101. 

7. The method of claim 6, wherein the step of isolating comprises incubating the mixture with 
molecule comprising a TSG 101 UEV domain. 

8. A composition identified by the process of claims 4 or 6, 

25 9. A composition capable of being identified by the process of claim 4 or 6. 

10. The method of claims 4 or 6, further comprising incubating the mixture with at least one of 
Vps4A, Vps4B, Ub, or Bc2. 

I I . A method of manufacturing a composition for inhibiting the interaction between TSG101 and 
Vps4A, Vps4B, Ub, or Bc2 comprising synthesizing the inhibitor of claims 4 or 6. 

30 12. The method of claim 1 1 further comprising mixing a pharmaceutical carrier with the inhibitor. 

13. A method of manufacturing a composition for inhibiting the interaction between TSG101 and 
GAG comprising admixing the inhibitor with a pharmaceutical carrier. 

14. The method of claims 1-3 wherein the inhibitor prevents expression of TSG101 mRNA. 

15. The method of claims 1-3, wherein the inhibitor interacts with the mRNA of TSG101. 
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16. The method of claim 1-3, wherein the inhibitor- is an interfering RNA. 

18. The method of claim 1-3, wherein the interfering RNA has the sequence set forth in SEQ ID 

NO:3. 

19. A method for inhibiting viral budding in a virus comprising proteins that interact with TSG101, 
5 comprising administering an inhibitor of the Vps pathway in a cell. 

20. The method of claim 19, wherein the virus is HIV or Ebola. 

21. The method of claim 19, wherein the inhibitor prevents normal trafficking of TSG101. 

22. The method of claim 19, wherein the inhibitor reduces the release of TSG101 from endosomes. 

23. The method of claim 19, wherein the inhibitor prevents normal Vps4 function. 
10 24. The method of claim 19, wherein the inhibitor interacts with Vps4. 

25. A method for reducing TSG101 function, comprising incubating an inhibitor of the kinase 
activity of Vps4A or Vps4B. 

26. A method for inhibiting HIV budding comprising administering an inhibitor of the kinase activity 
of Vps4A or Vps4B. 

15 27. A method of treating a subject comprising administering to the subject an inhibitor of HIV 

budding, wherein the inhibitor reduces the kinase activity of Vps4AorVps4B, and wherein the subject is in 
need of such treatment. 

28. A method of identifying an inhibitor of Vps4A or 4B ATPase activity comprising, a) 
administering a composition to a system, wherein the system supports Vps4A or 4B ATPase activity, b) 

20 assaying the effect of the composition on the amount of Vps4A or 4B ATPase activity in the system, and c) 

selecting a composition which causes a decrease in the amount of Vps4A or 4B ATPase activity present in the 
system relative to the system without the addition of the composition. 

29. The method of claim 28, wherein the system comprises retroviral budding activity and the Vps4A 
or 4B ATPase activity is determined by assaying the retroviral budding activity. 

25 30. A method of identifying an inhibitor of HIV budding comprising, a) administering a composition 

to a system, wherein the system supports HIV budding via a Vps4A or 4B ATPase activity, b) assaying the 
effect of the composition on the amount of HIV budding in the system, and c) selecting a composition which 
causes a decrease in the amount of HIV budding present in the system because of an inhibition of the Vps4A 
or 4B ATPase activity relative to the system without the addition of the composition. 

30 31 . A method of inhibiting HIV budding comprising administering a composition, wherein the 

composition prevents HIV budding, wherein the composition is defined as a composition capable of being 

identified by administering the composition to a system, wherein the system supports HIV -budding via a 

Vps4A or 4B ATPase activity interaction, assaying the effect of the composition on the amount of HIV 

budding in the system, and selecting a composition which causes a decrease in the amount of HIV budding 

35 present in the system because of an inhibition of the Vps4A or 4B ATPase activity relative to the system 
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without the addition of the composition. 

32. A method of inhibiting HIV budding comprising administering a composition that reduces 
Vps4A or 4B ATPase activity. 

33, A method of making a composition capable of inhibiting HIV budding comprising admixing a 
5 compound with a pharmaceutical^ acceptable carrier, wherein the compound is identified by administering 

the compound to a system, wherein the system supports HIV budding via a Vps4A or 4B ATPase activity, 
assaying the effect of the compound on the amount of HIV budding in the system, and selecting a compound 
which causes a decrease in the amount of HIV budding in the system because of an inhibition of the Vps4A or 
4B ATPase activity, relative to the system without the addition of the compound. 

10 34. A method of manufacturing an inhibitor to HIV budding comprising, a) administering a 

composition to a system, wherein the system supports HIV budding via a Vps4A or 4B ATPase activity, b) 
assaying the effect of the composition on the amount of HIV budding in the system, c) selecting a composition 
which causes a decrease in the amount of HIV budding present in the system because of an inhibition of the 
Vps4A or 4B ATPase activity, relative to the system with the addition of the composition, and d) synthesizing 

1 5 the composition. 

35. The method of claim 34, further comprising the step of admixing the composition with a 
pharmaceutical carrier. 

36. A method of identifying an inhibitor of Vps4A or 4B ATPase activity comprising a) 
administering a composition to a system, wherein the system comprises Vps4A or 4B ATPase activity, b) 

20 assaying the effect of the composition on a Vps4A or 4B ATPase activity, and c) selecting a composition 
...which inhibits a Vps4A or 4B ATPase activity. 

37. A cells comprising, a) a regulatable nucleic acid comprising sequence encoding a Vps4A or 4B 
gene and b) a nucleic acid comprising sequence encoding a budding retrovirus as well as cells further „ 
comprising a regulatable nucleic acid comprising sequence encoding a TSG101 gene. 

25 38. Disclosed are cells that further comprising an inhibitor of a Vps4A or 4B ATPase activity. 

39. A method of inhibiting HIV budding comprising, administering an dominant negative mutant of 
Vps4A, wherein the dominant negative mutant substitutes a Q at position 228 of SEQ ID NO:l 1. 
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SEQUENCE LISTING 

<110> University of Utah Research Foundation 
Sundquist, Wesley I. 
Garrus, Jennifer E . 
von Schwedler, Uta EC. 

<120> HUMAN VPS4A AND VPS4B FUNCTIONS IN 
RETROVIRAL BUDDING 



<130> 21101. 0029P1 

<150> 60/382, 702 
<151> 2003-05-21 

<150> 60/381,027 
<151> 2002-05-15 



<160> 11 



<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 390 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 



<400> 1 
















Met 
1 


Ala 


Val 


Ser 


Glu 
5 


Ser 


Gin 


Leu 


Lys 


Tyr 


Arg 


Asp 


Leu 


Thr 


Val 


Arg 


Glu 


Thr 








20 










25 


Lys 


Asp 


Leu 


Lys 


Pro 


Val 


Leu Asp 


Ser 






35 










40 




Ser 


Arg 


Glu 


Leu 


Met 


Asn 


Leu 


Thr 


Gly 




50 










55 




Gly 


Asn 


Thr 


Tyr Asn 


He 


Pro 


He 


Cys 


65 










70 






Pro 


Tyr 


Asn 


Pro 


Pro 


lie 


Cys 


Phe 


Val 










85 










He 


Lys 


Thr 


Gly 


Lys 


His 


Val 


Asp Ala 








100 










105 


Tyr 


Leu 


His 


Glu 


Trp 


Lys 


His 


Pro 


Gin 






115 








120 




Gin 


Val 


Met 


He 


Val 


Val 


Phe 


Gly Asp 




130 










135 






Pro 


He 


Ser 


Ala 


Ser 


Tyr 


Pro 


Pro 


Tyr 


145 










150 






Thr 


Ser 


Tyr 


Met 


Pro 


Gly 


Met 


Pro 


Gly 










165 










Gly 


Tyr 


Pro 


Pro 


Asn 


Pro 


Ser 


Gly Tyr 








180 










185 


Gly Gly Pro 


Tyr 


Pro 


Ala 


Thr 


Thr 


Ser 






195 










200 





10 15 
Val Asn Val lie Thr Leu Tyr 
30 

Tyr Val Phe Asn Asp Gly Ser 
45 

Thr He. Pro Val Pro Tyr Arg 
60 

Leu Trp Leu Leu Asp Thr Tyr 

75 80 
Lys Pro Thr Ser Ser Met Thr 
90 95 
Asn Gly Lys He Tyr Leu Pro 
110 

Ser Asp Leu Leu Gly Leu lie 
125 

Glu Pro Pro Val Phe Ser Arg 
140 

Gin Ala Thr Gly Pro Pro Asn 
155 160 
Gly He Ser Pro Tyr Pro Ser 
170 175 
Pro Gly Cys Pro Tyr Pro Pro 
190 

Ser Gin Tyr Pro Ser Gin Pro 
205 
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Pro 


Val 
210 


Thr 


Thr 


Val 


Gly 


Pro 
215 


Ser 


Thr 


lie 


Arg 


Ala 


Ser 


Leu 


He 


Ser 


225 










230 






Arg 


Met 


Lys 


Glu 


Glu 
245 


Met 


Asp 


Arg 


Lys 


Arq 


Thr 


Glu 

260 


Glu 


Asp 


Leu 


Lys 


Met 


Val 


Thr 
275 


Arg 


Leu 


Asp 


Gin 


Glu 
280 


Glu 


Leu 
290 


Leu 


Lys 


Lys 




Asp 
295 


Glu 


Met 


Glu 


Asn 


Gin 


Ser 


Glu 


Asn 


Asn 


305 










310 






Thr 


Ala 


Pro 


Leu 


Tyr 
325 


Lys 




lie 


Ala 


lie 


Glu 


Asp 
340 


Thr 


He 


r lie 


i yr 


Val 


He 


Asp 
355 


Leu 


Asp 


Val 


Phe 


Leu 
360 


Lys 


Gin 


Phe 


Gin 


Leu Arg 


Ala 


Leu 




370 










375 




Gly 


Leu 


Ser 


Asp 


Leu 


Tyr 






385 










390 







Arg Asp 


Gly 


Thr 


lie 


Ser 


Glu Asp 








220 










Ala 


Val 


Ser 


Asp 




Leu 


ZX y n 


J- 1 






235 










24 0 


Ala 


Gin 


Ala 


Glu 


Leu 


Don 


Ala 
nl d 


•uc U 




250 










2 ^ S 

Z. J ^ 




Lys 


Gly 


His 


Gin 


Lys 


Leu 


Old 


vJlll 


i< U J 










?7 n 






Val 


Ala 


Glu 


v a x 




Lys 


.Tito 11 


lie 










285 








Glu 


Leu 


Ser 


JC1 


Al d 


_ 

Leu 


HIT 11 

oiU 


Lys 








?no 










Asp 


He 


fl on 


C~\ n 
blu 


val 


lie 


Tip 
lie; 








315 










320 


Leu 


Asn 


Leu 


Tyr 


Ala 


Glu 


Glu 


Asn 




330 










335 




Leu 


Gly 


Glu 


Ala 


Leu 


Arg 


Arg 


Gly 


345 










350 






Lys 


His 


Val 


Arg 


Leu 


Leu 


Ser 


Arg 










365 








Met 


Gin 


Lys 


Ala 


Arg 


Lys 


Thr 


Ala 



380 



<210> 2 
<211> 1550 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 2 

gtagtggtgc cgacttcctg ttgtttgagg ccgggttggg ggtgtgcgat tgtgtgggac 60 

ggtctggggc agcccagcag cggctgaccc tctgcctgcg gggaagggag tcgccaggcg 120 

geegtcatgg cggtgtcgga gagccagctc aagaaaatgg tgtccaagta caaatacaga 18 0 

gacctaactg tacgtgaaac tgtcaatgtt attactctat acaaagatct caaacctgtt 24 0 

ttggattcat atgtttttaa cgatggcagt tccagggaac taatgaacct cactggaaca 300 

atccctgtgc cttatagagg taatacatac aatattccaa tatgectatg gctactggac 360 

acatacccat ataatccccc tatctgtttt gttaagecta ctagttcaat gactattaaa 4 20 

acaggaaagc atgttgatgc aaatgggaag atatatcttc cttatctaca tgaatggaaa 4 80 

cacccacagt cagacttgtt ggggcttatt caggtcatga ttgtggtatt tggagatgaa 54 0 

cctccagtct tctctcgtcc tattteggea tcctatccgc cataccaggc aaeggggeca 600 

ccaaatactt cctacatgcc aggcatgeca ggtggaatct ctccataccc ateeggatae 660 

cctcccaatc ccagtggtta cccaggctgt ccttacccac ctggtggtcc atatcctgcc 720 

acaacaagtt ctcagtaccc ttctcagcct cctgtgacca ctgttggtcc cagtagggat 780 

ggcacaatca gcgaggacac catccgagcc tctctcatct ctgcggtcag tgacaaactg 8 40 

agatggcgga tgaaggagga aatggatcgt gcccaggcag agetcaatge cttgaaacga 900 

acagaagaag acctgaaaaa gggtcaccag aaactggaag agatggttac ccgtttagat 960 

caagaagtag ccgaggttga taaaaacata gaacttttga aaaagaagga tgaagaactc 1020 

agttctgctc tggaaaaaat ggaaaatcag tctgaaaaca atgatatcga tgaagttatc 1080 

attcccacag ctcccttata caaacagatc ctgaatctgt atgeagaaga aaaegctatt 1140 

gaagacacta tcttttactt gggagaagee ttgagaaggg gcgtgataga cctggatgtc 1200 

ttcctgaagc atgtacgtct tctgtcccgt aaacagttcc agctgagggc actaatgeaa 1260 

aaagcaagaa agactgeegg tctcagtgac ctctactgac ttctctgata ccagctggag 1320 

gttgagctct tcttaaagta ttcttctctt ccttttatca gtaggtgccc agaataagtt 1380 

attgcagttt atcattcaag tgtaaaatat tttgaatcaa taatatattt tctgttttct 14 40 
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tttggtaaag actggctttt attaatgcac tttctatcct ctgtaaactt tttqtqctoa l^on 
atgttgggac tgctaaataa aatttgttgc ataaaaaaaa aaaaaaaaaa tttgt ^ ct ^ 1500 



<210> 3 
<211> 390 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 

<400> 3 

Met Ala Val Ser Glu Ser Gin Leu Lys Lys Met Val Ser Lys Tyr Lys 

Tyr Arg Asp Leu Thr Val Arg Glu Thr "l Asn Val He Thr Jeu Tyr 

t 25 30 

Lys Asp Leu Lys Pro Val Leu Asp Ser Tyr Val Phe Asn Asp Gly Ser 

Ser Arg Glu Leu Met Asn Leu Thr Gly Thr He Pro ill Pro Tyr Arg 

55 60 
Gly Asn Thr Tyr Asn lie Pro He Cys Leu Trp Leu Leu Asp Thr Tyr 

Pro Tyr Asn Pro Pro He Cys Phe Val Lys Pro Thr Ser Ser Met IL 

85 90 95 

He Lys Thr Gly Lys His Val Asp Ala Asn Gly Lys lie Tyr Leu Pro 

105 110 
Tyr Leu His Glu Tr P Lys His Pro Gin Ser Asp Leu Leu Gly Leu lie 

120 125 
Gin val Met lie Val Val Phe Gly Asp Glu Pro Pro Val Phe Ser Arg 
- LJ0 135 140 * 

Pro He Ser Ala Ser Tyr Pro Pro Tyr Gin Ala Thr Gly Pro Pro Asn 

150 155 i an 

Thr Ser Tyr Met Pro Gly Met Pro Gly Gly lie Ser Pro Tyr Pro Ser 

^ m 165 170 175 

Gly Tyr Pro Pro Asn Pro Ser Gly Tyr Pro Gly Cys Pro Tyr Pro Pro 

185 190 
Gly Gly Pro Tyr Pro Ala Thr Thr Ser Ser Gin Tyr Pro Ser Gin Pro 

Pro Val Thr Thr Val Gly Pro Ser Arg Asp Gly Thr He Ser Glu Asp 

215 220 

225 SSr J?" 116 Ser Ala Val Ser Asp Lys Leu Arg Trp 

235 *?/in 
Arg Met Lys Glu Glu Met Asp Arg Ala Gin Ala Glu Leu Asn Ala Leu 

245 250 ore: 

Lys Arg Thr Glu Glu Asp Leu Lys Lys Gly His Gin Lys Leu Glu Glu 



260 265 270 

Lys 

Glu Leu Leu Lys Lys Lys Asp Glu Glu Leu Ser Ser Ala Leu Glu Lys 



Met Val Thr Arg Leu Asp Gin Glu Val Ala Glu Val Asp Lys Asn He 
/3 28° 285 



295 3 00 
Met Glu Asn Gin Ser Glu Asn Asn Asp lie Asp Glu Val He He Pro 
JU3 310 315 



Thr Ala Pro Leu Tyr Lys Gin He Leu Asn Leu Tyr Ala Glu Glu Asn 



325 330 
Thr He Phe Tyr Leu Gly 
345 

Val He Asp Leu Asp Val Phe Leu Lys His Val Arg Leu tlu Ser Arg 



Ala He Glu Asp Thr He Phe Tyr Leu cly Glu Ala Leu Arg Arg Gly 

34 5 350 



365 
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Lys Gin Phe Gin Leu Arg Ala Leu Met Gin Lys Ala Arg Lys Thr Ala 

370 375 380 

Gly Leu Ser Asp Leu Tyr 
385 390 

<210> 4 
<211> 437 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 4 

Met Thr Thr Ser Thr Leu Gin Lys Ala lie Asp Leu Val Thr Lys Ala 

1 5 10 . 15 

Thr Glu Glu Asp Lys Ala Lys Asn Tyr Glu Glu Ala Leu Arg Leu Tyr 

20 25 30 

Gin His Ala Val Glu Tyr Phe Leu His Ala He Lys Tyr Glu Ala His 

35 40 45 

Ser Asp Lys Ala Lys Glu Ser He Arg Ala Lys Cys Val Gin Tyr Leu 

50 55 60 

Asp Arg Ala Glu Lys Leu Lys Asp Tyr Leu Arg Ser Lys Glu Lys His 
65 70 75 80 

Gly Lys Lys Pro Val Lys Glu Asn Gin Ser Glu Gly Lys Gly Ser Asp 

8 5 90 95 

Ser Asp Ser Glu Gly Asp Asn Pro Glu Lys Lys Lys Leu Gin Glu Gin 

100 105 HO 

Leu Met Gly Ala Val Val Met Glu Lys Pro Asn He Arg Trp Asn Asp 

115 120 125 

Val Ala Gly Leu Glu Gly Ala Lys Glu Ala Leu Lys Glu Ala Val He 

130 135 140 

Leu Pro He Lys Phe Pro His Leu Phe Thr Gly Lys Arg Thr Pro Trp 
145 150 155 160 

Arq Gly He Leu Leu Phe Gly Pro Pro Gly Thr Gly Lys Ser Tyr Leu 

165 170 175 

Ala Lys Ala Val Ala Thr Glu Ala Asn Asn Ser Thr Phe Phe Ser Val 

180 185 190 

Ser Ser Ser Asp Leu Met Ser Lys Trp Leu Gly Glu Ser Glu Lys Leu 

195 200 205 

Val Lys Asn Leu Phe Glu Leu Ala Arg Gin His Lys Pro Ser He He 

210 215 220 

Phe He Asp Glu Val Asp Ser Leu Cys Gly Ser Arg Asn Glu Asn Glu 
225 * 230 235 240 

Ser Glu Ala Ala Arg Arg He Lys Thr Glu Phe Leu Val Gin Met Gin 

245 " 250 255 

Gly Val Gly Asn Asn Asn Asp Gly Thr Leu Val Leu Gly Ala Thr Asn 

260 265 270 

He Pro Trp Val Leu Asp Ser Ala He Arg Arg Arg Phe Glu Lys Arg 

275 280 285 

He Tyr He Pro Leu Pro Glu Glu Ala Ala Arg Ala Gin Met Phe Arg 

290 295 300 

Leu His Leu Gly Ser Thr Pro His Asn Leu Thr Asp Ala Asn He His 
305 " 310 315 320 

Glu Leu Ala Arg Lys Thr Glu Gly Tyr Ser Gly Ala Asp He Ser He 

325 330 335 

He Val Arg Asp Ser Leu Met Gin Pro Val Arg Lys Val Gin Ser Ala 
340 345 350 
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Thr 


His 


Phe 
355 


Lys 


Lys 


Val 


Cys 


Gly 
360 


Pro 


Ser 


Arg 


Thr 


Asn 
365 


Pro 


Ser 


Met 


Met 


He 


Asp 


Asp 


Leu 


Leu 


Thr 


Pro 


Cys 


Ser 


Pro 


Gly Asp 


Pro 


Gly Ala 




370 










375 










380 










Met 


Glu 


Met 


Thr 


Trp 


Met 


Asp 


Val 


Pro 


Gly Asp 


Lys 


Leu 


Leu 


Glu 


Pro 


385 










390 










395 










400 


Val 


Val 


Cys 


Met 


Ser 
405 


Asp 


Met 


Leu 


Arg 


Ser 
410 


Leu 


Ala 


Thr 


Thr 


Arg 
415 


Pro 


Thr 


Val 


Asn 


Ala 
420 


Asp 


Asp 


Leu 


Leu 


Lys 
425 


Val 


Lys 


Lys 


Phe 


Ser 
430 


Glu 


Asp 


Phe 


Gly 


Gin 
435 


Glu 


Ser 

























<210> 5 
<211> 437 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 5 



Met 


Thr 


Thr 


Ser 


Thr 


Leu 


Gin 


Lys 


Ala 


. -L JL tr 


7\ or^k 
n o ^.J 


Leu 


Vdl 


I ill 


Lys 


■H.j.a 


1 








5 










10 










1 S 




Thr 


Glu 


Glu 


Asp 


Lys 


Ala 


Lys 


As n 


T\/T 


Glu 






Leu 


Arg 


Leu 


lyr 








20 










25 










30 




Gin 


His 


Ala 


Val 


Glu 


Tyr 


Phe 


Leu 


His 


Ala 


He 


Lys 


Tyr 


Glu 


Ala 


His 






35 










40 










45 








Ser 


Asp 


Lys 


Ala 


Lys 


Glu 


Ser 


He 


Arg 


Ala 


Lys 


Cys 


Val 


Gin 


Tyr 


Leu 




50 










55 










60 










Asp 


Arg 


Ala 


Glu 


Lys 


Leu 


Lys 


Asp 


Tyr 


Leu 


Arg 


Ser 


Lys 


Glu 


Lys 


His 


65 










70 










75 










80 


Gly 


Lys 


Lys 


Pro 


Val 


Lys 


Glu 


Asn 


Gin 


Ser 


Glu 


Gly 


Lys 


Gly 


Ser 


Asp 










85 










90 










95 


Ser 


Asp 


Ser 


Glu 


Gly 


Asp 


Asn 


Pro 


Glu 


Lys 


Lys 


Lys 


Leu 


Gin 


Glu 


Gin 








100 










105 










110 






Leu 


Met 


Gly 


Ala 


Val 


Val 


Met 


Glu 


Lys 


Pro 


Asn 


lie 


Arg 


Trp 


Asn 


Asp 






115 










120 










125 






Val 


Ala 


Gly 


Leu 


Glu 


Gly 


Ala 


Lys 


Glu 


Ala 


Leu 


Lys 


Glu 


Ala 


Val 


He 




130 










135 










140 










Leu 


Pro 


He 


Lys 


Phe 


Pro 


His 


Leu 


Phe 


Thr 


Gly 


Lys 


Arg 


Thr 


Pro 


Trp 


145 










150 










155 










160 


Arg 


Gly 


He 


Leu 


Leu 


Phe 


Gly 


Pro 


Pro 


Gly 


Thr 


Gly 


Lys 


Ser 


Tyr 


Leu 










165 










170 










175 




Ala 


Lys 


Ala 


Val 


Ala 


Thr 


Glu 


Ala 


Asn 


Asn 


Ser 


Thr 


Phe 


Phe 


Ser 


Val 








180 










185 










190 






Ser 


Ser 


Ser 


Asp 


Leu 


Met 


Ser 


Lys 


Trp 


Leu 


Gly 


Glu 


Ser 


Glu 


Lys 


Leu 






195 










200 










205 






Val 


Lys 


Asn 


Leu 


Phe 


Glu 


Leu 


Ala 


Arg 


Gin 


His 


Lys 


Pro 


Ser 


He 


He 




210 










215 










220 










Phe 


He 


Asp 


Glu 


Val 


Asp 


Ser 


Leu 


Cys 


Gly 


Ser 


Arg 


Asn 


Glu 


Asn 


Glu 


225 










230 










235 










240 


Ser 


Glu 


Ala 


Ala 


Arg 


Arg 


He 


Lys 


Thr 


Glu 


Phe 


Leu 


Val 


Gin 


Met 


Gin 










245 










250 










255 




Gly 


Val 


Gly 


Asn 


Asn 


Asn 


Asp 


Gly 


Thr 


Leu 


Val 


Leu 


Gly Ala 


Thr 


Asn 








260 










265 










270 






He 


Pro 


Trp 


Val 


Leu 


Asp 


Ser 


Ala 


He 


Arg 


Arg 


Arg 


Phe 


Glu 


Lys 


Arg 



275 280 285 
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lie 


Tvr 
290 


lie 


Pro 


Leu 


Pro 


Glu 
295 


Glu 


Ala 


Ala 


Arg 


Ala 
300 


Gin 


Met 


Phe 


Arg 


Leu 


His 


Leu 


Gly 


Ser 


Thr 


Pro 


His 


Asn 


Leu 


Thr 


Asp 


Ala 


Asn 


He 


His 


305 








310 










315 










320 


Glu 




Ala 


Ar g 


Lys 


Thr 


Glu 


Gly 


Tyr 


Ser 


Gly 


Ala 


Asp 


lie 


Ser 


He 








325 










330 










335 




lie 


Val 




Asp 


Ser 


Leu 


Met 


Gin 


Pro 


Val 


Arg 


Lys 


Val 


Gin 


Ser 


Ala 






34 0 










345 










350 






X I IX. 






T . \/ c 

XJjS 


i-i_y o 


Val 


Cys 


Glv 


Pro 


Ser 


Arg 


Thr 


Asn 


Pro 


Ser 


Met 














360 










365 








Met 


lie 


Asp 


Asp 


Leu 


Leu 


Thr 


Pro 


Cys 


Ser 


Pro 


Gly Asp 


Pro 


Gly Ala 




370 








375 










380 










Met 


Glu 


Met 


Thr 




Met 


Asp 


Val 


Pro 


Gly 


Asp 


Lys 


Leu 


Leu 


Glu 


Pro 


385 








390 










395 










400 


Val 


Val 


Cys 


Met 


Ser 


Asp 


Met 


Leu 


Arg 


Ser 


Leu 


Ala 


Thr 


Thr 


Arg 


Pro 








405 








410 










415 




Thr 


Val 


Asn 


Ala 


Asp 


Asp 


Leu 


Leu 


Lys 


Val 


Lys 


Lys 


Phe 


Ser 


Glu 


Asp 








420 








425 










430 






Phe 


Gly Gin 


Glu 


Ser 




























435 





























<210> 6 
<211> 444 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 6 






























Met 


Ser 


Ser 


Thr 


Ser 


Pro 


Asn 


Leu 


Gin 


Lys 


Ala 


He 


Asp 


Leu 


Ala 


Ser 


1 








5 










10 










15 




Lys 


Ala 


Ala 


Gin 


Glu 


Asp 


Lys 


Ala 


Gly Asn 


Tyr 


Glu 


Glu 


Ala 


Leu 


Gin 






20 










25 










30 






Leu 


Tyr 


Gin 


His 


Ala 


Val 


Gin 


Tyr 


Phe 


Leu 


His 


Val 


Val 


Lys 


Tyr 


Glu 




35 










40 










45 








Ala 


Gin 


Gly Asp 


Lys 


Ala 


Lys 


Gin 


Ser 


He 


Arg 


Ala 


Lys 


Cys 


Thr 


Glu 




50 










55 










60 










Tyr 


Leu 


Asp 


Arg 


Ala 


Glu 


Lys 


Leu 


Lys 


Glu 


Tyr . 


Leu Lys 


Asn 


Lys 


Glu 


65 








70 










75 










80 


Lys 


Lys 


Ala 


Gin 


Lys 


Pro 


Val 


Lys 


Glu 


Gly 


Gin 


Pro 


Ser 


Pro 


Ala 


Asp 






85 










90 










95 




Glu 


Lys 


Gly Asn Asp 


Ser 


Asp 


Gly 


Glu 


Gly 


Glu 


Ser 


Asp 


Asp 


Pro 


Glu 






100 










105 










110 






Lys 


Lys 


Lys 


Leu 


Gin 


Asn 


Gin 


Leu 


Gin 


Gly Ala 


He 


Val 


He 


Glu 


Arg 


115 










120 










125 








Pro 


Asn 


Val 


Lys 


Trp 


Ser 


Asp 


Val 


Ala 


Gly 


Leu 


Glu 


Gly 


Ala 


Lys 


Glu 




130 








135 










140 




His 






Ala 


Leu 


Lys 


Glu 


Ala 


Val 


He 


Leu 


Pro 


He 


Lys 


Phe 


Pro 


Leu 


Phe 


145 








150 










155 










160 


Thr 


Gly 


Lys 


Arg 


Thr 


Pro 


Trp 


Arg 


Gly 


He 


Leu 


Leu 


Phe 


Gly 


Pro 


Pro 






165 










170 










175 




Gly 


Thr 


Gly 


Lys 


Ser 


Tyr 


Leu 


Ala 


Lys 


Ala 


Val 


Ala 


Thr 


Glu 


Ala 


Asn 




180 










185 










190 






Asn 


Ser 


Thr 


Phe 


Phe 


Ser 


He 


Ser 


Ser 


Ser 


Asp 


Leu 


Val 


Ser 


Lys 


Trp 




195 










200 










205 








Leu 


Gly 
91 n 


Glu 


Ser 


Glu 


Lys 


Leu 
215 


Val 


Lys 


Asn 


Leu 


Phe 
220 


Gin 


Leu 


Ala 


Arg 
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Glu Asn Lys Pro Ser lie He Phe He Asp Glu He Asp Ser Leu Cys 
225 230 235 240 

Gly ser Arg Ser Glu Asn Glu Ser Glu Ala Ala Arq Arg He Lys Thr 

245 250 255 

Glu Phe Leu Val Gin Met Gin Gly Val Gly Val Asp Asn Asp Gly He 

260 265 270 

Leu Val Leu Gly Ala Thr Asn He Pro Trp Val Leu Asp Ser Ala He 

275 280 • 285 

Arg Arg Arg Phe Glu Lys Arg He Tyr He Pro Leu Pro Glu Pro His 

290 295 300 

Ala Arg Ala Ala Met Phe Lys Leu His Leu Gly Thr Thr Gin Asn Ser 
• T 305 u 310 315 320 

Leu Thr Glu Ala Asp Phe Arg Glu Leu Gly Arg Lys Thr Asp Glv Tvr 

325 330 335 Y 

Ser Gly Ala Asp He Ser He He Val Arg Asp Ala Leu Met Gin Pro 

340 345 350 

Val Arg Lys Val Gin Ser Ala Thr His Phe Lys Lys Val Arg Glv Pro 

355 360 3 6 5 

Ser Arg Ala Asp Pro Asn His Leu Val Asp Asp Leu Leu Thr Pro Cvs 

370 375 380 

Ser Pro Gly Asp Pro Gly Ala He Glu Met Thr Trp Met Asp Val Pro 
385 390 395 J 400 

Gly Asp Lys Leu Leu Glu Pro Val Val Ser Met Ser Asp Met Leu Arg 

405 410 415 

Ser Leu Ser Asn Thr Lys Pro Thr Val Asn Glu His Asp Leu Leu Lys 

420 425 430 

Leu Lys Lys Phe Thr Glu Asp Phe Gly Gin Glu Gly 
435 440 

<210> 7 
<211> 437 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 7 






























Met 


Thr 


Thr 


Ser 


Thr 


Leu 


Gin 


Lys 


Ala 


He 


Asp 


Leu 


Val 


Thr 


Lys 


Ala 


1 








5 










10 










15 




Thr 


Glu 


Glu 


Asp 


Lys 


Ala 


Lys 


Asn 


Tyr 


Glu 


Glu 


Ala 


Leu 


Arg 


Leu 


Tyr 




His 




20 










25 










30 




Gin 


Ala 


Val 


Glu 


Tyr 


Phe 


Leu 


His 


Ala 


He 


Lys 


Tyr 


Glu 


Ala 


His 






35 










40 








45 








Ser 


Asp 


Lys 


Ala 


Lys 


Glu 


Ser 


He 


Arg 


Ala 


Lys 


Cys 


Met 


Gin 


Tyr 


Leu 




50 










55 










60 








Asp 


Arg 


Ala 


Glu 


Lys 


Leu 


Lys 


Asp Tyr 


Leu Arg 


Asn 


Lys 


Glu 


Lys 


His 


65 










70 










75 






80 


Gly 


Lys 


Lys 


Pro 


Val 


Lys 


Glu 


Asn 


Gin 


Ser 


Glu 


Gly 


Lys 


Gly 


Ser 


Asp 










85 










90 










95 


Ser 


Asp 


Ser 


Glu 


Gly 


Asp 


Asn 


Pro 


Glu 


Lys 


Lys 


Lys 


Leu 


Gin 


Glu 


Gin 








100 










105 










110 






Leu 


Met 


Gly 


Ala 


Val 


Val 


Met 


Glu 


Lys 


Pro 


Asn 


He 


Arg 


Trp 


Asn 


Asp 






115 










120 










125 




Val 


Ala 


Gly Leu 


Glu 


Gly 


Ala 


Lys 


Glu 


Ala 


Leu 


Lys 


Glu 


Ala 


Val 


He 




130 










135 










140 








Leu 


Pro 


He 


Lys 


Phe 


Pro 


His 


Leu 


Phe 


Thr 


Gly 


Lys 


Arg 


Thr 


Pro 


Trp 


145 










150 










155 








160 
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Arg Gly lie Leu 

Ala Lys Ala Val 
180 

Ser Ser Ser Asp 
195 

Val Lys Asn Leu 
210 

Phe lie Asp Glu 
225 

Ser Glu Ala Ala 

Gly Val Gly Asn 
260 

lie Pro Trp Val 
275 

lie Tyr lie Pro 
290 

Leu His Leu Gly 
305 

Glu Leu Ala Arg 

lie Val Arg Asp 
340 

Thr His Phe Lys 
355 

Met lie Asp Asp 
370 

lie Glu Met Thr 
385 

Val Val Cys Met 

Thr Val Asn Ala 
420 

Phe Gly Gin Glu 
435 



Leu Phe Gly Pro 
165 

Ala Thr Glu Ala 

Leu Met Ser Lys 
200 

Phe Glu Leu Ala 
215 

Val Asp Ser Leu 
230 

Arg Arg lie Lys 
245 

Asn Asn Asp Gly 

Leu Asp Ser Ala 
280 

Leu Pro Glu Glu 
295 

Ser Thr Pro His 
310 

Lys Thr Glu Gly 
325 

Ser Leu Met Gin 

Lys Val Cys Gly 
360 

Leu Leu Thr Pro 
375 

Trp Met Asp Val 
390 

Ser Asp Met Leu 
405 

Asp Asp Leu Leu 
Ser 



Pro Gly Thr Gly 
170 

Asn Asn Ser Thr 
185 

Trp Leu Gly Glu 

Arg Gin His Lys 
220 

Cys Gly Ser Arg 
235 

Thr Glu Phe Leu 
250 

Thr Leu Val Leu 
265 

lie Arg Arg Arg 

Ala Ala Arg Ala 
300 

Asn Leu Thr Asp 
315 

Tyr Ser Gly Ala 
330 

Pro Val Arg Lys 
345 

Pro Ser Arg Thr 

Cys Ser Pro Gly 
380 

Pro Gly Asp Lys 
395 

Arg Ser Leu Ala 
410 

Lys Val Lys Lys 
425 



Lys Ser Tyr Leu 
175 

Phe Phe Ser Val 
190 

Ser Glu Lys Leu 
205 

Pro Ser lie lie 

Asn Glu Asn Glu 
240 

Val Gin Met Gin 
255 

Gly Ala Thr Asn 
270 

Phe. Glu Lys Arg 
285 

Gin Met Phe Arg 

Ala Asn lie His 
320 

Asp lie Ser lie 
335 

Val Gin Ser Ala 
350 

Asn Pro Ser Val 
365 

Asp Pro Gly Ala 

Leu Leu Glu Pro 
400 

Thr Thr Arg Pro 
415 

Phe Ser Glu Asp 
430 



<210> 8 
<211> 37620 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 8 

ggatccaatc 

tgacagttcc 

tattttgtag 

attgaggtta 

ggttccatca 

gtttgttttt 

ctcactgcaa 

ctggagttac 

gagtttcgct 

ctgcctccta 

tacaggaacg 

ccatgttggc 



cagaatccca 
tcagtctttc 
aagtagaaaa 
caggtttggg 
taacaacatg 
accgattctc 
tctctgcctc 
atgcaggcac 
ctgtagccca 
ggtcccggtt 
caccaccatg 
caggcgggtc 



tactgcattt 
cttgtctttt 
accctccata 
aagcatacca 
gcttattact 
actctgttgc 
ctgggttcta 
catcacgcct 
ggctggagtg 
caagaaattc 
cccagctaat 
ttgaactgct 



agttgtcatc 
atgatcttga 
tgaatttttc 
cagagctaat 
ggtgataaaa 
ccaggctgaa 
gtgattctcc 
ggctaatttt 
ctgtggtgcg 
tcctgcctca 
ttttgtattt 
gacctcaggt 



ttcttagtct 
catttttgag 
tgatgttttc 
gtgcccttct 
gaggggtttt 
gtgcggtggt 
tgcttcaccc 
tgtttgtttg 
atctcagctc 
gcctcccgag 
ttagtagaga 
gatcgcccgc 



ctacaatctg 
gagcactggt 
tcatgattag 
ccacatcagg 
ttgtttgttt 
gtgatcttgg 
tcccaagtag 
ttttgagaca 
accacaacct 
tagctgggat 
cagggtttca 
cttggcctcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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caaagtgctg ggattacagg cgtgactcac cacacccggc ctaatttttg tatttttagt 780 

agagatggag tttcaccatg ttggccaggc tggtctcgaa ctcctgacct caagtgatcc 840 

acccacctcg gcctcccaaa gtgctgggat tacaggcgtg agtcaccacg cccggcccat 900 

ctttgcttaa attataaact cagtttctcc cacagttagt tcagcctatg ctcaggaatg 960 

tacaaggaca gcttggaggt taaaagtaag atggagttgg ttaagttaga tctctttcac 1020 

tgtctcagtc atcattttgc gaaggcaaat tcacaagtgg gaagaaagga cgggtagatg 1080 

cagggcagag agagggctct gtgcttcgag cctggatgat acagatttgg caatactatt 1140 

ctaaggagag accagagaaa gaagtttagg gtagaaggta attaattaga gtatgttgaa 1200 

tttgtcttgc agaaaaaccc tgtgaaggaa acctctggga ggctgttgta aatgctgggc 1260 

tggggatggg gtgggtctag agatccagat ttgaagcaat ttctagtcta ttgatagctg 1320 

aagctcaaga agagagttag atgagcagag gtgaagagag aagaggacat cagactcttg 1380 

agtaaattca gcttaagctc agggaggagt aagaaaggga gacagtgaag gaagcagata 1440 

aggaatattg ggaagctgga caaacaagaa ctgggtagtt gacaatacca atgaagaggg 1500 

aagaggagag gactttaatt ctttcggaga tgccagatga gaaaaaatta agaaatgagt 1560 

gagaaaaaaa tgatggagtg attaaacatc tatgttgctt tcctttgaga atgttaacta 1620 

tgaaaagaag aaaagggtgg tggtggctgc aaaccatagg aaggctgaga aagggtttta 1680 

ctttggttct gcctctaatc cctgacgcat tcaacctcct caagccattt gcttgtaaac 1740 

ataactggca taaataatat ctgatatgcc cattttacag gacaagtgct tccctctcag 1800 

accacatact cgaggacagc ggccgtccac gccttattta cattcatctt cccaacacaa 1860 

cagtcatttg gaagatgccg agtggcgtgg gaagaacaag cgggaacgag cgccgacccg 1920 

cccgcggcct gactgctagc gcaccgctgt gcggccgcct caccgcgcac tccctcctga 1980 

attattcacc accgagacga ggacggtgat cgcccccatt ccacagagga cacggagtga 2040 

aggcgcagtc acaggcccaa agcccccagt cggcgggtct tggaaatgga cgactttttc 2100 

actcaaaagc tacgaggcaa caaagcatgt ggatgaggag ggaagggagc cggggccaat 2160 

aattcatctc aaagtaaaag acgtccccgg gggtggggaa caaacaaaca acaaaacccg 2220 

gctctgtgct tccgcgcttg cgctctccta ggtctgccag gctgccgccc gcgcctaggc 2280 

cagagtctca gtgcgcctgc gccgagcccg acttccgcca gggctcgatt ggcttccagg 234 0 

ctacttctcc cggcccaggg agggcctgcg ctgtggctag agaagggagt ctggtgactg 2400 

ggtgcgcgtg ctgatgacga aatcggaagt gcccggagca gaggcgggac cagaacctag 24 60 

agggcctcgg gattgcggaa gtttggtggg gagggtcgga gctctggtgg agagagtgtt 2520 

gtctaaaaca agttccggaa gggaggctgc ccttcgcggt ccgagaacca ccggcctccc 2580 

cagtttgagg gctgttaccc cgtgcgcgct tcgacgttgc tgctgttggc tctcctcgcc 2640 

cctcgttccc ttgggaaccg cctgggaact ccgccatgtc atccacttcg cccaacctcc 2700 

aggtatcatt gctcatctcc cgtcccccta gggagctggg atcgagggga ggcgcgagaa 27 60 

gcgggtgggg ggcggggaga agggaggaag cggccgcgcc tgtggtcgag gcgcaggagg 2820 

gcgggtccgg ggacctgcag caggtgggag aggggcccgt tttccaacgg aaaggtcgtt 28 8 0 

tctggcccgc tcttggggtc tcccgagcta ctggctttgt gagactcttc acttttgttg 2940 

acacttcccg cttccaaacc agccaagatc tgggcaggtt tctccaatca gggttaacgg 3000 

agaaagaaag aacgtaattt ttgcaaagcc ctattctgga ccagttgaag atagggtggt 3060 

catagttcgg tcttagtttc tcaagctgac agcagacccc ttcattctga gctgtcaggt 3120 

gagtagagat atgggaacag accaagaatc tgagattaca gccagcctgc atgcaatctc 3180 

cgtgaagtca ttgaatttat gtgcacactc ctgtggctcc cagatggtag tgtgtaagat 324 0 

cgaggagtat tcttaaacct ctgtgtggct ccacttcccc tcctctaccg caatgagcac 3300 

agtgttttgt acacaaatca tggaatctta agattttagg cttaatcgtt agtcctctgt 3360 

ggaatggaat tcgttcagaa aagaaatttc ttaatgtaat ggtgtctccc atccccagaa 3420 

tggatctagg gcttcagtgg cgtttagaat gtaatgtggc ttaatgtaga ggactcccag 3480 

tccagtaagt caataaggca caaaagagta gcctaaaata cttgtattga gtccccgtcg 3540 

tggacttccg gtagtttgtt gaacaagcag ttattggagt gcctgacaga ggcatcaaag 3600 

gtaatggaag aactccttaa cccatgataa ttgattctga agaaagacca ctcaaagtag 3660 

acccacttaa acagagatta aaatttcaca cacacataca caaaaagagt gggaaaggag 3720 

gcttgcttgg gactttcatc ttgaaaatcg acctagaaga ttattaaaat aattgttatc 3780 

tttcacaaaa tattataatt gtataattgc aacaatgact cattttgtgg aataagtcaa 3840 

gtgttgattt gtctgtgttt ttttcctttt cttttctttc tttttttttt tttttttgag 3900 

acggagtctt gcacggtcgc ccaggcttga gtgcagtggc gtgatttcag ctcactacaa 3960 

cctccacctc ccaggttcaa gcgattctct tgcctcagcc tcctgagtac ctgggactac 4020 

aggcacccac caccacgccc agctaatttt tgtattttta gtagtagaga tgaggtttca 4080 

ccatgttggc ccaggctggc ctcttgacct caggtgatcc gcctacctca gcctcccaaa 4140 

gtgctgggat tacaggcgtg agtcaccgtg cccggatgac tatctttttc accaaggttg 4200 

tattaaacta agattcagca atcttcactt gagttttttg tagctttttg tagtcctatt 4260 
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tggaaaggtt 

ttttcccttg 

ttaacaattt 

ctctacaaca 

gctctgttgc 

ccaggttcaa 

caccacgcct 

cgctggtctt 

gttacaggtg 

ggctgttctt 

taagcaggaa 

cattcaacct 

acttcctgct 

tgcagataac 

ccctgtggtt 

taaatgaaat 

tgtcattcac 

aaggttgaga 

ttacagtata 

ggtaaggggc 

agaaaaattg 

gacagagtct 

acctccatct 

ttacaggcat 

gccaggccag 

gtagaattac 

gaggaagagt 

gtagggattg 

aataatttgg 

gtccggtgaa 

aaggatttat 

agagtttaca 

ttaaggagag 

ggcgcagtgg 

cagcttagga 

agccaggtgt 

tggcttgagc 

ctgggtgaca 

gtgtcatgac 

gcggagaggg 

gggagaaata 

ggaggcaaag 

tgtgtagttt 

ggaaaaaagg 

agcagaggaa 

gggcctggga 

ttctaaggtg 

cctttatact 

agtcttgtgt 

gaactgctct 

aaaacaaaaa 

ttaagtcact 

tgaatatact 

agccagaatg 

gtccaagaag 

ggtgtttgga 

gaaagagaat 

aactaacacc 

cgatggaaat 



ctttttggta 

agtattaaag 

ttattccacc 

taatcttcat 

ccagactgga 

gtgattcttg 

ggctaatttt 

gaactcctga 

tgagccaccg 

cagtgaatca 

ggatcttgaa 

tttcaacctt 

gaacaaggca 

taaaaatcaa 

ttactttcca 

attctagaat 

ttcagttcat 

acagtgcaat 

ttgtaataaa 

caaattagta 

aagatggaga 

cactctattg 

cccaggttca 

gcaccaacac 

tctcaaagtc 

aggcgtgagc 

ctgagttttc 

gttctgggaa 

ttctaaactt 

tatttgcata 

tagcgtacag 

gaataaggca 

agaaatggga 

ctcatgcctg 

cttcgagacc 

ggtggtgcat 

ccaggaggtg 

cagtgagacc 

agccaggcaa 

agtgagataa 

ggagatcagg 

agatgagtga 

gtgttctgca 

aaaggttgga 

ggtggggatc 

cacctcttct 

gatgtgtctg 

tccgtatgga 

cagccactgt 

tctcctggag 

aaacatatgc. 

gatgtagggg 

tcattgtcac 

gtgggaggaa 

gaggaaataa 

tctagcatgc 

agaggccact 

gaacagtgtt 

atctagtgga 



aacgtatttg 

gtgactcttc 

agcttttcat 

catttgcagg 

gtgcagtggc 

tgcctcagcc 

ttctattttt 

cctcaagtga 

.tgcccggccc 

gccagagact 

ttgtaatttt 

ctaacctagg 

aaaatctctg 

aattatataa 

cagtttcaat 

aaatagttca 

atcatgacgt 

aagatgttga 

atgtactggg 

gagtatccat 

agagtcaagc 

cccaggcagg 

agcgattccc 

acccagctaa 

ctgaactcag 

cactgctccc 

tcccagattt 

tacaaaagga 

gtcgagtttg 

catatgtgta 

gtggtagcag 

aaacaaagac 

gcctggaaaa 

taatcccagt 

agtctgggca 

gcctgtagtc 

gaggtaacac 

ctgtctaaaa 

ggagacactg 

gcatatgatt 

aaactgggat 

gatgtaggga 

gatgtgagtc 

gagaacagtt 

agctccacag 

gaggtaggta 

gtactcctaa 

taactggatc 

gctagactct 

cttacattct 

atgctcaga.c 

gggaggaaac 

ataagagagg 

aaccagcaaa 

tcaactttgt 

tagagagcct 

ctggtgagat 

ttgtcaaata 

gagaaattgt 



gaaaaagcgt 

tgaagtgccc 

ttgtcagaga 

tttttgttgt 

gtgatcttgg 

tcccgagtag 

agtagagaca 

tccgcccgcc 

cagaatgcat 

gaccaacatg 

gttagtggta 

aaatttggat 

tggccagatg 

tgtaataaag 

aacccgaagt 

taattttgaa 

aggcatttta 

gagagagaaa 

caaggaggca 

tcaaatgaag 

ttgtatctgc 

agtgcagtgg 

ctgcatcagc 

tttttgtaga 

gtgatccacc 

tgctgtatct 

ctgggttggg 

gagctatttg 

agatgcttgt 

gaagtaggga 

aggttgtagg 

cacgttagaa 

gaattatgaa 

gttttgggag 

acatggtgaa 

ccagctactc 

tgaggcgtga 

aaaaaaaaat 

tcagggtatt 

tggctatcag 

gaacatgggt 

gaggagaacc 

tgagatagga 

gataggcatg 

cagaggagaa 

attgtagaat 

gtggaacagg 

ctgactgatc 

gggaatataa 

aatagaagag 

acacacatgc 

agagagaaat 

aagaatcaat 

gtggtggttg 

aaaatgtcag 

tgatatgagc 

ttttggctct 

agacttttta 

taagaatgac 



tggtcatttg 

atcttgtcat 

ctttgcatgt 

tgttgttttg 

cgcactgcaa 

ctgggattac 

gggtttcacc 

ttagcctccc 

catacttgaa 

aaaggcataa 

ggttcacatg 

taatattcat 

cttataatct 

ctgcacagta 

taacctgggt 

acactatgtc 

ttatttcaca 

ccacattcac 

aggtctgtgg 

tggaaaggat 

atttattttt 

catgatctcg 

cccccatgta 

gatggggatt 

cccgttggcc 

gcattttaaa 

tatctgagta 

gggccagggc 

ggagacaatc 

gagtggacta 

catacagagt 

attctatgga 

tcaaaagtaa 

gccgaggcag 

aacccctctc 

cagaggctga 

tcacacccgt 

actaaaacta 

tgttgtgtgt 

gaggcagaaa 

attctttggg 

gaggagctga 

caacacattt 

tcattgatgg 

agaattggtc 

ttaccccatt 

acaattccac 

ctggggtgac 

agaagtgaag 

agaggccaat 

acacacaaag 

gagtcccagc 

gaaggacgct 

ggggctcagc 

aagatcaaga 

cgttttatta 

aaaaggtagc 

aaggagggct 

agaagtgaag 



gtaaaagtat 

cgttggcttt 

gatttcgata 

agacagtctt 

cctccacctc 

aggtgcctgc 

atgttggcca 

aaagtgctgg 

ttggactgtt 

ttgttactct 

tgtgctttct 

gtagtgatta 

aatgagggaa 

ttctccactt 

ccaaaaattt 

acaatgcctg 

taatcacaag 

atcactttta 

gatgggaggc 

tgatcaaagg 

ttttttttga 

gcttatggta 

gcagctggga 

cgctgtgttg 

tcccaaagtg 

agtgagggag 

cccaactcag 

aagggagcaa 

aaatggaaat 

ggctggaaat 

cacatcctgg 

ttagcgacag 

aaccaggcca 

cagatcgctt 

tacaaaaatt 

ggtgggagaa 

gtactccagc 

gaagaaaatg 

gttaaatgct 

gcattgctgt 

attttgctgt 

ggagtgatgg 

ttctgctgag 

agtaaatgtc 

tcaacagaca 

tctcagggtt 

atgttcccaa 

acagtcattc 

acaacagaca 

aaattaaaaa 

atcatcagcc 

ccacacctga 

gagaaggagg 

agaggacagt 

agacagagct 

tgtgatacag 

agagaaacag 

gtttgtatgt 

taaccaaaag 



4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
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gaagaagata ctgaaaaagg agtagggaat tttttttttc attcgttaaa caatttcgag 7860 

tacctgttgt atgccggcac cattctaggc atgggtagct caaggtgaag tcctgtcctc 7920 

atggaaccta cctccctaaa ggagacagcc agtgggtgag catacaggta aatacaatga 7 980 

cagtgatggg atgagagagt gcagtggggg cagatagcaa agagaaggaa tgaggtgcag 8040 

cctttgctat agcaaattag ggaggcctct ctaaggactt gccatctgaa ctaggactga 8100 

aaaggagaca tcatgcatct gtctggggaa aggagggaag agcattctag gtggaagggt 8160 

tcaactgctt gggctgcctg agtgcaggag gattctaggg ttcggggaga accacacgta 8220 

gctagggccc aggttatttc agaccttgta gatcagctct agcatcagga ttttattgtt 8280 

tttaatggga gaccatcaag gttacctttt atgttttgag gtagcaaggg tgggaacaga 8340 

ctgaccactc acgaggctgg gtagtagttt tgacatatat gatagcagct tgaactaggg 8400 

tgttagcaag ggagatggtg aggattatat tcagaaagaa aattagctct ggctgaaaag 8460 

ggaagataat gtttgatgga ccagagcaga tggagctgga agaggggaag acaggcaaca 8520 

gactctgcca gtattatctg tctgagactg aagaactcca gtcagtagtg aatgataaca 8580 

gccaggacct ccagaggctg caggcataaa tgaatgaact caatgcaaaa gttcacctat 8640 

tgcaggagga gctacagctg ctgcaggaac agggctcctg tataagggaa gtagtccggg 8700 

ccatgaataa ggagaaagtg ttagtcaagg tgtatcccaa gggcaagttt gtcttagatg 8760 

tggacagaaa cagcaacatc aatgacatga gacccaattg ccaggggggc gctaagaaat 8820 

gatagcgaca cactgcacaa tatcataccc agcaaagtag acccactggt gtcaccagtg 8880 

atggtggaga aggtgccaac cttgatttaa gagatgattg gtggactgga caagcagatt 8940 

aaggacatca cagaagtgat cgagcagcct gttaagcatt ccaagctctt tgaagcactg 9000 

ggtatcacac aacccagggt ggtactgctg tacggacctc taggcactgg gaagacactg 9060 

tttgcctagg ctgtggctca ccatacagac tgtacctttg ttcatatccc tggctctgaa 9120 

tttgtataga aattcatcgg ggaagggcca agaatggtga gggagctgtt tgtcatggca 9180 

tgaggacctg ccatctgtca tcagctccat cagctcctcg cagctggagg ggggttctgg 9240 

aggggacagt gaagtgctgc acatcgtgct ggaactgctc aaccaactgg acagctttga 9300 

ggcccccaag aagatcaagg ttatcatgga ccactgcttc actcagggca catcaacaga 9360 

aaaattgaat tccagctgga cattttgaag atccattctc tgaaaaggaa cctgacccag 9420 

gggatcaacc agagaaaaat tgccaagctt atgccaggag catcaggggc tgaagtgaaa 9480 

gtcatataca gagaaagcag catatatgcc ctgagggaac ggcgagtgca tgtcacccag 9540 

gaggactttg agatagcagt agccagcgtc atgcagaagg atagagagta aaacatgtcc 9600 

atcaagaaac tatggaagtg cggggatgtc ctttgtatgg atccctccgg taaagctttc 9660 

tggaatgaaa aaataaaata attagctctc cagaattaac catagaagac atggttaatt 9720 

ttatttcaaa tgaaaaatgc taacagacat ggttaatttt atttcagatg aaaaatgtta 9780 

acagaaataa ggttgaaatg gtattcatca gcagaaaaaa aacagatgca tatcctctta 9840 

attcatcgtc ttcgattaaa ggtttaacgt gtgccaccca gtacagtggc tgcatgtggc 9900 

tattttaatt acataaactc actcctcagt cacactagcc acatttcaag tgctcagaag 9960 

ccatatacag acattctcat catcagggaa aggctggttt aattagcact atgtttggag 10020 

ctagatgctt aaataatacc cactgcagta taagagattt aacttaataa aaatcagtgt 10080 

ggtgcctcag tgaggcctga ctgaatgaga ggaacaagtg gctgaagagg tgagtggcac 1014 0 

agcatttcaa aatcacctga gaagaaaatt atagttagga taagaaagag cttatttcta 10200 

ggccaggcac ggtggctcac gcctgttatc ccagcacttt gggaggctga ggcaggtgta 10260 

tcacctgagg tcaggagttt gagaccagcc tggccaacat ggtgaaatcc catctctact 10320 

aaaaatacaa aaattagcca ggcatggtgg tgggtgcctg taatcccagc cactggggag 10380 

gctgaggcag gagaatcaca tgaacctggg aggcagagat tgcagtgagc caagatggcg 104 4 0 

ccattgcact ccagtctggg cgacaagagt gagactccat ctcaaaagaa aaaagaaaaa 10500 

ggatttattt ctgctatgtg gcttaaccac tgaggctttg agatgcattt ttttttttct 10560 

tttttgagac agagtcttgc tctgtcaccc aggctagagt gcagtggcat gatctcggct 10620 

cactgcaacc tctgtctccc ggattcaagc gattctcctg actcagcctc ccaagtagct 10680 

gggattacag gcgcctggct aatatttttt gtatttttag agatggggtt tcattatatt 10740 

ggccaggctg gtctcgaact cctgacctca agtgatccgc cctcctcggc ctcccatact 10800 

gctgggatta caggcgtgag ccactgtgct tggcctcact ctgtcaccca ggctggagtg 108 60 

cagtggcaca atcttggctg tggtgcattt ttatatgacc agtctgacct aaccaccact 10920 

tgactgtgtg actccatgtt ttatgttcat atactagctg ataaaacttg aaaatttagc 10980 

tttgaattta ggctacttga cctgtgagat acgtaactat tacattttat caattctgaa 11040 

gtaaaccatt tatttttctt ccctgctaca atagaggaag catggtcttc cagcgggaga 11100 

cctgttcctc ccatttacag atctcagccc agttatatcc aatccatgct caactgcccc 11160 

aaaatgtaat attcttacac ataattggtt tttgcattgt ttgaatgatt ttttgcttat 11220 

gttatatact tactttttaa tatctctaac aactgtcaat tttcattttt ttttactggc 11280 

acctatttct gttgaatagt tttactacct aaattaaccc agttggaatg ttttaataag 11340 
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tcatgtttgc cctttatttg 
tggagtctca ctctgttgcc 
caccgcctcc cgggttcaag 
ggcgcatgcc gctacgcctg 
gttgcccacg ctggtcgcaa 
gattcacacg cagttgtaag 
cagtggcagc atcttgtaga 
ttccaccact cttactcata 
tgggtgtggt tgtttgtgca 
tttatgtatt ctcgatacta 
tatagattgt cctttcatcc 
gatccagtta cctttttttt 
ctgcgcatcc ccagatccta 
tggtgtggac ttgtgatcct 
aggagtttga ggctagcttg 
aagaatcaaa gttttgtagt 
gtaggaagta ttaagtttaa 
cctctagcac cacttcttaa 
tttcaaaaat ctcttggctt 
ccacaaaagt agtccatgtt 
tgaaaaatat atgttttatc 
cctgtgctgc ttttttctgc 
gtcctgcttt ttaaaaatgc 
atgttatttt agtgatcctt 
caggttttgt ttttgttatt 
ttagttaatt actggattga 
ccattgcttc tgaagaggtt 
ttaaagcagc atttaatttt 
attctattat ttcttattag 
tgttagcgat tggtactaaa 
catcttttca acttgtaatt 
ttagttactg tttatttttt 
gctttaaaaa aaaaaaaaac 
tgatttaaat gttgttattt 
gcagcgcaag aagacaaggc 
gtgcagtatt ttcttcatgt 
acgaaaagga aaatttattc 
tctaaacgat ccatttgtta 
gtattgagat gtagctagca 
catcttaaaa tgtagtctag 
tacagatatg tattgcgtgt 
ggttaagacc cctgccctca 
cttttgagct tctcccattt 
cataacctgt aagctatgtg 
ttgaaaacag ctctgacaga 
gaatatatgg gatttccatc 
gaaatgatta atactgcttt 
aagcaacttt gaactatctg 
aataatttcc tgttagttaa 
ctgattcaga cccactgttg 
taatcattgg ctatgatgag 
cttgtctgaa aaaatacata 
aaaaatctct ctcatttccc 
atcagggcaa agtgtacaga 
aataaagaga aaaaagcaca 
aaggg<? ta cg tgtttaaaat 
atttcttttg attaaaacag 
tgcctgtttt tggaataagc 
tgtctagggc tgctttcgtg 



tatgtatctg 
caggctggag 
caattctcct 
gtcaattttt 
actcctgagc 
aaatactgag 
actgggacaa 
ttttcccagt 
gttttatcac 
gttctttgtt 
ttttaacaaa 
tcttttatat' 
agttttccat 
gctatttggg 
gtcgacatag 
tttacattta 
gtcagaattc 
aaagcctgac 
tattagtatg 
ccttatggac 
aaagataaac 
tgttgtttca 
ccattattaa 
tggtgttctg 
taaaagacct 
aatatatgag 
tcaccccatt 
tgagaatttc 
ccctaaattt 
gtaacattca 
tataataaaa 
acttattttt 
tttcagtatg 
tgcctccctc 
tgggaactac 
cgttaaatgt 
tgtttcattt 
aaaaaaaaaa 
aatgtgacag 
tcagtggaaa 
gtaccacgtg 
tagctcttaa 
agttctgagg 
gggctcatcc 
ggcctttgtg 
actttactct 
ccttctattt 
caaaaatgta 
cattaactta 
gtctccaacc 
ttagagatga 
tttaagagtt 
tttaatagat 
atatcttgat 
gaagccagtg 
tgttcaattt 
gtgtcagcaa 
tttatcagaa 
ccgtaacaac 



tgcataattt 
tgcagtggca 
gcctcagtct 
tgtgttttag 
tcaggagaat 
aatccagcat 
tatcacaacc 
tttacttgtc 
atgtagatat 
agatatgttt 
atcttccaga 
agtatgcttt 
gttttttcta 
gaggctgatg 
tgagatggag 
attctatgat 
ttttaggtgt 
tcggtgccat 
gatttatttc 
attttttaga 
tctgtttaac 
aataaattcc 
cactttccat 
ttttgtggca 
ctctgataat 
tatttttcag 
tacatttcca 
tctaaatgct 
aactttagct 
gtagcattta 
tacagttgtg 
gttttctttc 
attatttact 
aaacagaaag 
gaagaagccc 
gaggccatag 
ctaaaaaatt 
atactccact 
ccatcatagt 
ataggttagg 
gcaggcactg 
atggaatgca 
ttttatgtta 
catctattgc 
tagtaaaatt 
tctgaaataa 
gtctttttta 
agggaaaaat 
atataccctg 
ctgctggagg 
tcttccctcc 
caaacctttc 
gaagcacagg 
agagcagaaa 
aaagaaggac 
tattgtggct 
actatggctc 
cacagccgtg 
aaagttgtgt 



tttttttttt 
tgatcttggc 
cccaagtagc 
tagagacggg 
tttagtttga 
tccctttatc 
aggatattga 
cttatgtgtt 
gtgttacatt 
tgcaaatatt 
gagcaaaagt 
tgatgtcaaa 
aaagttttag 
caggatgatc 
ccccgtctca 
ccatttttga 
atgcttatgg 
tgactggctt 
tgggactttt 
agatgtgaaa 
aatttccttc 
tgctacatat 
attattacaa 
tcctttatga 
atctgtgctc 
attcttgata 
ctattgttat 
tgtttagttt 
aagtgaaaat 
ttcctataac 
agttagaaaa 
acagcccagc 
tcaaatttat 
cgatagatct 
ttcagctcta 
gttatatttt 
gaaatataat 
ttctttctag 
tattactttg 
agacattcta 
gggatacaat 
gttagatgat 
aagtgagact 
acgtgttgcc 
tacgatgttg 
agcttttgta 
taattgggtt 
atgaaagctt 
ctgtcttgct 
aaggggtaga 
ttcttccaac 
tacatagaat 
gtgataaagc 
aactaaagga 
agccgagtcc 
ttttgctttc 
atgggccaaa 
cccattcttt 
ggttgtgtgg 



ttttttgagg 
tcactgcaac 
tgggactata 
gtttcactgt 
gataattgta 
cagttgtctg 
tattgataca 
tttatgtgtg 
ttgagagttc 
tctcccagtc 
tttgtcataa 
tctaagaact 
ctgggtgcgg 
acctgagccc 
acaaataaat 
gttaattttt 
atgtctagtt 
tttgcatcat 
taaaaaaatg 
aagtagaaaa 
ccatcttttt 
aaaaagttgt 
atatttctta 
tttaagtttc 
agtaaacctt 
tttcattgcc 
atcttttaga 
gaattttcac 
tacatgtaaa 
atgtaatttg 
aaatgatttt 
aagctgagaa 
gtttatgctt 
ggctagcaaa 
tcagcatgct 
atttaaaaac 
gccagacaat 
tgatttttaa 
tatatgtaaa 
tttgtgtctt 
ggaaagaaat 
tgaagatgtt 
aagaatgagt 
aaagttcact 
gtattacaca 
gacattcata 
tgtctaagac 
cagaatatta 
taggttgttt 
aaagaggtga 
atagaatttc 
actaatggaa 
caagcaaagt 
gtacctgaaa 
agcagatgag 
gatttgtgaa 
gctgcccagc 
tacatattgt 
agaccgtatg 
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gcccacgaaa 

ctagtttgcc 

taggaggaga 

ggaagcttaa 

aaaagtaact 

aatttgtgtt 

aaacgttgaa 

cagaatttgt 

catattctaa 

cacaccctta 

cagcgagatt 

gcaggttatt 

ttggggcttc 

gtgttgactt 

tcaaggtttt 

cctaagaaca 

ctgatcttac 

ggtataaaac 

gactgcccag 

ttgtccatag 

ttacctgaca 

aggacccact 

atagtggggc 

tcccactgtt 

ctctgcgcag 

gcagccccca 

tgagaaacgg 

tggacagcat 

ctcacccaca 

gtcatgcttt 

tggctttatc 

tacatcacgt 

actttttccc 

ttgtctaaaa 

ctacttgttc 

cccagactaa 

tgacctcttg 

agctctcctg 

aattaataca 

tctttttgtc 

tactacattc 

gaaattacaa 

gtgatcattt 

tctgatgatc 

ttttaaattt 

gttctccctc 

ctggctttgt 

ttatgtaata 

tttctttttg 

caaaatttga 

ttattgccgt 

tctgcttttc 

cacctgtaaa 

aggtttttgg 

aattgtgggc 

gagcaatagc 

atctgacgaa 

ttgttggctg 

aatttttcag 



cctaaaatat 
aagttgaatg 
gaatgcttgt 
ggttatagga 
ccaacaatgt 
agatttgctt 
actgattttt 
ttattcagta 
ccacatcctc 
agaaaatgag 
ctgctcagct 
tagacaatac 
gtggctgcta 
ggctgcttca 
atattctctg 
tgttatcttc 
aaattactga 
agggactcca 
gacctccagc 
aagctcttct 
aggcgatgcc 
ttggctaatt 
cagggaatca 
gccaactcct 
ctccctcagc 
ccttcttgga 
cagggttttt 
gatagaaaga 
gccccaccgt 
tatctccctt 
acttttatac 
ttaaatgttg 
cttttgcaaa 
ttatttcctc 
tttgggtgta 
gggttgtaga 
ttctaattta 
tgattcatct 
tacctaaata 
cctttctttt 
tttaaaataa 
atatttcagg 
gaatatatta 
ctgaaaaaaa 
actatcctat 
tatatttaaa 
ttagagtgtt 
gttttgttgg 
tttatttgcc 
tggtattaaa 
tgctgtagct 
agtgttcctg 
gccctttcat 
ctccattcag 
aagtgactga 
ttcctttcag 
gtctagggac 
gctttgttaa 
attgaatttt 



ttacttgttt 
gataatattc 
aggaatagta 
gtgaccagag 
aaattttttt 
tatgttgtca 
ggcatgtctt 
catgacatat 
tctggctgtc 
actctgggat 
ttagaggatg 
cctgtggtca 
ccacagacca 
aactatgtgg 
ttcaagatgt 
ttcatcgata 
attccaactt 
aacatgaagt 
tgtcactgag 
ctcttgcttg 
agtttttctt 
taagcaaaaa 
ggagaaatgg 
ctgctaagca 
aaaaccagtg 
agaatttagg 
tttgtttgtt 
ggttggagtg 
cctatgataa 
tatatgttat 
ttaacattgt 
caatgtatcc 
cagtgtgggt 
aagataaatc 
gttcattgtc 
cctttgtacc 
tatcttgccc 
tgaatccttc 
tgtcatggga 
acatgattct 
gtaagcataa 
ttaatttttc 
aattcttttt 
gaaactacag 
taaaaagact 
tgaagtttaa 
ttcttagggt 
cttcacatct 
gccttttaga 
atttactact 
cacttattga 
tttacattta 
tctgttctac 
atcttgattc 
atctctcaga 
ggcttttaag 
tcaataaatg 
tcactgagat 
ttgctcaccc 



ggtcctttac 
ctcaaacaac 
ctcagttact 
agttaaacct 
tggatatctt 
ttgtaaagaa 
cctagatggg 
gcgtgccaag 
ctattaattg 
ggagttatct 
tggttctcat 
caaggaatga 
tttggaagta 
cagagtttga 
gatcggggaa 
gcactgactt 
agggtttctt 
gaaaatattg 
actcccttga 
aaaaccctgt 
ttgtactgaa 
gggatatatt 
ccaacagtac 
ctgatggtgc 
ccttcgcctg 
tctcatgtgg 
tgtttttaat 
agtgtcatgt 
ccaaggcaat 
tgttttaaaa 
atcaggaata 
tgtcatatgc 
gtgattatcc 
cctagatatg 
atcaggaagc 
ttagtgttcc 
agtcctgatg 
atggagtgat 
atttggagct 
ctattgatga 
cttttaatgt 
cttcattttt 
attaggaatg 
aatcaacttc 
caaagtgaat 
gaactggctt 
ttgccctcta 
gcctttaaaa 
gaaggtaaca 
gtgggagaag 
tggtttattt 
attgtttaag 
tatagcacag 
tgttcaaggg 
accagcgtcc 
attaatgaca 
gtagcactgt 
ttgattattc 
attggtagta 



aaaaaaaagt 
ccttcctgac 
ctctggccat 
ttgaccacat 
atgcaaaatt 
aacaacacag 
ggatgtatca 
tttgtgctca 
atgccagtag 
ggcctgcttg 
tgttcatggt 
agtgcctttt 
catgaggaat 
aggaaatgat 
ccagggagtt 
aacaaaaatc 
atatgaaagt 
ggaagaattt 
cagcagaagc 
aaggatctct 
cagcaatctt 
atagggtatt 
cagggttgtt 
ttgggagcag 
aagattctgt 
aaccctagct 
aaggacagcc 
gaatcttttg 
tccattttgg 
atgatataat 
tctttccaat 
tagtccctat 
ttatggtcaa 
aatgaaatgc 
ttcctctgtc 
agattttttt 
tttttactgt 
gtgtggggca 
agttcctgac 
taacgtgact 
aaatgtctga 
gccttatttg 
acagtgatgg 
aaggttgctt 
tgtctatttg 
tggttcccaa 
ggatcatggt 
ggctctcagt 
cttatactat 
ttaggactta 
tgtttcttaa 
ctttctgtgt 
tggcataaca 
tctgccactt 
cccatctcta 
gtgtaaaaac 
ttatctaaaa 
ttcgaaataa 
tgtattttgt 



ttgctgaccc 
tgcctgctag 
tagctaatat 
ggaaaaaatg 
ttttgatgtg 
cataacttat 
cttctagatg 
gcactgggaa 
ttccaggtaa 
ggtttgaaaa 
atgcagtggc 
aggcagtgct 
ataagaacca 
aagcttaagc 
tctgtgacac 
agatgatagt 
gaggacattt 
agttgacctt 
ttctcttccc 
tctgagatga 
cgtttacagt 
accttacaga 
caagtaggga 
gtagtaaatc 
ctccttgtga 
acaagggaga 
tttatagtac 
cagtgtctac 
cataaaacat 
gccttatttt 
aaactatttc 
aatatgagtt 
atgcttgaac 
cttgatcgac 
ctttggaatc 
cctcctaata 
tgttgtttgc 
tgcatttatg 
ttcattctgg 
gcttcttatt 
ttttgccact 
atagcaatga 
ggaaggagaa 
tagatttcat 
aattgttttt 
tagtcatgtg 
ttggtctttt 
caacttgctc 
tagcacactg 
cacgtggctt 
ctgct'cttca 
ttaccatcca 
gaggggacat 
agaactttct 
aaatagaaat 
tacctcatat 
aggcatctat 
attatctaac 
tcatttgaat 
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cttaccaaat 
ctgaatagtg 
ttttggtagt 
taatgtgttg 
agtggtagat 
caccacctga 
tgggtctgtg 
tattggattt 
tctttaattt 
tcgctgtgtt 
tcctgggttc 
accaccatgc 
gtagaggtgg 
ttgcccactt 
atacaaccag 
aaatgtaaaa 
gaaatgtatt 
ttgaatatcc 
cttttttgaa 
attatgatat 
gttagctaat 
gtcaaagacc 
aatcccatcc 
aacattccaa 
tttttaaaat 
atttttgttt 
cgaaatgaat 
ttatggtttc 
ttgctagtaa 
agcagtgatt 
aactaactac 
tattctagaa 
atattaaaag 
ctgcattcag 
atagatctat 
tgcaggggta 
cttttccttc 
ttttttcagt 
gtagaacaag 
acgtttggtg 
ctatctaagt 
agaagggtaa 
gactactccc 
aactttccca 
ttggtgaaat 
tagaacgacc 
tgaaagaggc 
ttaaataatt 
cattcttgtc 
tctttataca 
aaaacaacct 
ctcagtaaaa 
aagaatgtcc 
catgtttgct 
atatatgcct 
tgcaatggtt 
gtttcagagg 
gaatattgct 
atgtctgtgc 



agaatgcata 
aaaatactta 
atttttgtct 
acaacagacc 
atagtccaag 
aactactggc 
ccaagtggcc 
ggttttgttt 
tcgaaaagat 
gcccaggctg 
cagggattct 
ccggctaagt 
ggtttggcca 
cagcctccca 
tttttaatag 
caaacaaata 
tgaaatttca 
ttttttttct 
tttttccttg 
taagtgtata 
tttattcagt 
ggataaattt 
ccccttttta 
agtcaccttt 
ttatagcaga 
aactgacatg 
ttactttttt 
atgtaatttt 
ctagtcttag 
tcataaagta 
atgtcatctt 
aattagaaaa 
acatccttct 
aagctaaatc 
ttagcagcca 
tcttttctcg 
tcccttctgt 
ggttgaggag 
tacttttgat 
cttttccaaa 
agtttgggca 
agtaacatgc 
taacacctgc 
attttataat 
agtctttctt 
aaatgtgaaa 
tgtgatactg 
ttatttcatg 
actaatgaat 
agatgtttta 
aaatgtcaac 
tagagtgtac 
acatctattg 
aaatgtgtat 
aggggataag 
atgattttta 
aagtgcataa 
gacctaccat 
taaaccattc 



actgggaagg 
cctgagccaa 
tgcctaccct 
aggtaatgaa 
ggcaaaatct 
acacactgcg 
ttattccagt 
ccctctccct 
acattttttt 
gagtgcagtg 
cctgcctcag 
tttttttttt 
tgttgcccag 
aagagctggg 
ggatttctct 
aaagagaaaa 
ggatgaaata 
actttttaat 
ttttttgaac 
tttgttaaaa 
gaattttttg 
atatcattgg 
attttgcaaa 
taacttagaa 
aatcttgagt 
aaatcctagg 
cttactttat 
aatctagtag 
agtaacagat 
atagattgta 
agaacaggct 
gcttttgata 
tggactctta 
ttccctgtaa 
aatagaaact 
gtttcccatc 
ctgtctgcct 
aggattagat 
actccaaaat 
cactctaata 
gatggtatta 
cccaggttgt 
ttcctgctca 
gcgtttcttt 
aatttaaaga 
tggagtgacg 
cctattaaat 
agtttgtaaa 
tatcttaaat 
tttcttatac 
cagtagggta 
atctacatta 
caaagatcaa 
gctgagaagt 
attataggag 
aagagtattt 
cctctagtag 
ctggggaaac 
ctttatatta 



atttccattt 
acttttagcc 
actgggatgt 
tagagaacat 
tgagtttaca 
cactcagttg 
tgtaaagcaa 
accctgcccc 
tttctttttt 
gcgcaatcac 
cctcctgagt 
gtttgtttgt 
gctgctctcg 
attacaattg 
aaattcccta 
aaaaccaaat 
gaatatgaaa 
ggtttgctgc 
agtattttaa 
ttaaaaatcg 
tgattaagta 
aagtacattc 
ccatgtctag 
gtattatata 
cagacggtat 
attactaatc 
agtagtctga 
ctaaagattt 
tatctttcct 
taagtggtgg 
gcttcatagt 
aaagagaact 
ctgcaataat 
gttaactttg 
tgggcaaatc 
at ct get tec 
atcttgagtt 
caccttttct 
caatgtattc 
ttcattatgt 
ttttcatttc 
ctagtcatcc 
egctatttea 
aacatcatta 
taaegtaatt 
ttgctggact 
ttcctcatct 
agttcattga 
aaattgtgac 
atatccatat 
attcaccaac 
ataaattgaa 
aaagcagtta 
aagtatagtt 
atttttactt 
cttaaaaaga 
caaaaggaca 
ttacaagttc 
tccttcttgt 



tataggttca 
ttgetatact 
ctaggaaaac 
etttagatge 
etcaagegta 
gcatttattg 
gacacttttt 
aggtaaaaat 
gtttttttga 
ggcttactgc 
agctgggaac 
ttgtttgttt 
aactcctgag 
tgagccaccg 
tcacttacaa 
acttaaaaca 
ggagtcctag 
tggatttcat 
atcttttact 
gtttgtgatt 
atatactagt 
catttgttag 
tatgagattg 
tgaaaacaaa 
acatatataa 
ttaatgaaac 
acttttggaa 
tcataattta 
aacaagaaga 
tgttctgtta 
ttatagtgga 
agaaaatttg 
tgctttggct 
ttaattaaca 
ttaggaaagt 
attccttcct 
aaatttgtta 
tttggttaac 
aaatataacc 
catttgattt 
acaagtgagg 
acctggttcc 
aaaaggttgt 
tttcatgttg 
ttttgcaggt 
tgaaggagee 
ttttacaggt 
tgactgaggt 
tgttaaaagg 
ttgcagtgta 
atgcagtgga 
atetgeattg 
taaactgaat 
agattcacat 
ttgactttca 
ggagtaagaa 
tttcattatc 
tagttccagc 
gactgatagc 



gccttggtta 
agtgtcttcg 
actgtaagga 
caagaacttc 
gtggtgtccc 
ggtaacccag 
ttgtctgctc 
cagttgeata 
gacaagggtg 
agcctcgaac 
acaggcacac 
gtttgttttt 
ctcaagtgat 
cacttggccg 
taaaggccag 
tttaaatgag 
ccttttatat 
actattgtat 
tctgttttag 
tgagttttct 
aaatagtata 
catttttcat 
tttatcagta 
tcatgacaca 
aataaagttt 
ataaaaaaga 
tacaaaatgc 
gggaagtaag 
aacaatcagt 
cacttaaaac 
acagtttcca 
tcagtaggct 
tttgaatata 
aaagttgtaa 
attgeaaate 
ttcctctctc 
agcatatttt 
aggagaaatg 
accttatgga 
tcgcaccaac 
aagctgaaac 
tagtcctatt 
tagtacataa 
tcttctaact 
gccattgtta 
aaagaagcac 
gacataactt 
gaagcattgt 
tggtgatata 
tatatattgc 
atactatgea 
tattttattg 
agtatgattc 
agaaatatta 
tctgaatttt 
tcacttaaca 
actgtgaaga 
agtaagtgat 
cttgactgta 
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atatggccac 
atgtcaacag 
atagagcagt 
gtgcagttta 
aatcctataa 
tttttctcaa 
cattactaat 
atagcataaa 
caaagcaagc 
ggcaggagcc 
ccagagatta 
cgttattgaa 
acattaaaaa 
atatgtacca 
ttcaagtaac 
ttcttttgca 
ttttgttttg 
atggcgtggt 
cagcctcccg 
ttttactaga 
gtgatccgcc 
agccatagtt 
attttaggca 
cactggttat 
catgaaagtc 
cgagaccagc 
ggcgtaatgg 
tgaaaccagg 
caacaagagt 
cactaacctg 
ctttgttagg 
ccatttttta 
atgatcagaa 
aaatcctttt 
aacaaatgta 
ctctagaaga 
gttttgatcc 
tacaattaag 
caccttggag 
aagctgtagc 
tttctaagtg 
ctttgcctgt 
agagaagtca 
tcttttgtgg 
agttgtagca 
tcttgtaata 
gttttcattt 
ctccattatc 
tgaagccgca 
tttttttttt 
cgatctcagc 
cctgagtagc 
ttttaaagtc 
cagaatttct 
catatgcata 
agaaaagaga 
atgtagactt 
ccacaatctg 
tcatattgcc 



acccacagtc 
caaatctgca 
gatgaagagc 
ctacccaact 
aagcacaaaa 
acatgagtca 
ttttaatatt 
tatacagaga 
atgccagtgt 
ttctgcctca 
ttttgcctgt 
aagactgctt 
attatttaaa 
tactcagaca 
agttgctttc 
gatactgtgt 
tttctttttt 
ctcgtttcac 
agtatttggg 
gtcagggttt 
catctcagcc 
ttttaaatgt 
ggattataat 
attttggcaa 
tgctacctag 
ctgaccaaca 
cgggcaccta 
aggcagaggt 
gagactctgt 
tgctttcctg 
ttcacttttc 
tggatataat 
agtacctttt 
gcctgtgcac 
cttcctgtta 
aacaaaatat 
ttagaggttt 
tgttaaaatt 
gggaatccta 
aacagaagcc 
gctaggtgaa 
aattactgta 
tgtaatacag 
gaaaagatag 
ttcttatttt 
ttgatatatt 
atttttagac 
ttcattgatg 
cgtagaatta 
ctttttgaga 
tcactacaac 
tgggattaca 
gtttttgttt 
ccattctaac 
cataattttt 
tttaatgtat 
cgaatgaaat 
ggagatttga 
gtaaggtgat 



tgacagtcag 
atgttggcct 
aaaaagcctg 
tcaggagtat 
atatggttga 
ttgatgtatc 
tttctttgct 
catacaggag 
catcaaccag 
ttcctcccga 
atttgaactt 
tgtgccttta 
gtatatacca 
ctgacttgta 
ataacagtta 
acacatccat 
tgagacagag 
tgcaacctcc 
attacaggca 
caccatgttg 
tctcaaagtg 
atgtttgata 
tgctcattag 
aactaggatt 
ctggaggctg 
tggtaaaacc 
taatctcagc 
tgcagtgagc 
ctcaaaaaaa 
tggaatcaag 
caaagagcat 
ttttgtacaa 
aactgaactc 
atgttacggt 
gaaggcttat 
atttaagaca 
agagtgaaaa 
ctcatgacat 
ttatttgggc 
aacaactcaa 
agtgaaaagt 
tcactcctta 
gtactcactt 
catatatcta 
ctgtacctga 
actttgtttt 
tggttaagaa 
aaattgattc 
agacggagtt 
cggagtctca 
ctccacctcc 
ggcgcgcacc 
attttataac 
aaactgtttc 
actcaattgt 
tcatttataa 
ttttcaagaa 
ttcatatatt 
atttgattaa 



aggtttttcc 
cagactgaag 
taggccacgc 
gaaacttaac 
actcatttta 
atcttcatga 
tattttttag 
cccctgggtg 
atccagaaat 
gaggtcatct 
taaatcatct 
taagtggaaa 
cctaaaagta 
aatgtataca 
agtgaaacag 
agtttgcacc 
tttcactctt 
acctcccgag 
tctgccacca 
gccacagttg 
ctgggattac 
gtttactgcc 
caaaattgag 
taaagtagta 
aggtgggcag 
ccgtctctac 
tacctgggag 
cgagatcgca 
ataaaataaa 
aattcagcag 
aaatattcaa 
tattctggta 
ttttccccta 
tttgatacat 
gttatgcaag 
taccttagat 
tctttccaga 
ttttctaaat 
cgcctggaac 
catttttttc 
aagtagtaaa 
atagatttct 
ctctgtcaaa 
caacctagtg 
aacatgatta 
aatcttttac 
tttattccaa 
tctctgtggt 
cctagtgcaa 
ctctgtcgcc 
caggttcaag 
accatgcccg 
aatttttata 
attttgccat 
aattatcttg 
tgtaccagga 
atattttgaa 
tgcatcatat 
atacagttct 



ttttgcaaac 
agtccagaca 
actgtccgcc 
agcataaaag 
tggtatatta 
atttttccat 
aacaacattt 
ttgggcttcc 
aacagcattg 
ctctctctcg 
cacatatttt 
aaaaagtgtt 
cttcatgtta 
tgtaaaatag 
tgttattcat 
tagcagtttt 
gtcacccagg 
ttcaagcaat 
cacccagcta 
tcccgagctc 
atacatgagc 
agctgtaacc 
gctcgaagag 
tcaccttttt 
atcacctgag 
taaaatacaa 
gctgaggcgg 
ccactgccct 
agaaagtctg 
aaactcacaa 
agaaaatctc 
tcaagcagtg 
ctagactcca 
attggttgaa 
tctctttttg 
tgatttattt 
gtattcctca 
ctaaatttta 
aggaaagtcc 
aatatcttcc 
tcaatttttt 
tacatcactt 
tcagtctcta 
aatcataaag 
ggctttttaa 
tattttttag 
cttgccagag 
tcaagaagtg 
atgcaaggta 
caggctggag 
caattctcat 
gctaatttaa 
acagtaatga 
gtttcattcc 
taggactcac 
acaacttcag 
acgaatgtta 
tgcatcatag 
tttccaaaga 



atgttggttt 
gtcgtatcgt 
tagcttcaag 
ttgacatcat 
cttataccac 
gtatgtgtac 
ttactagagt 
ttaattgtca 
ccagggacct 
acttctaacg 
tacttagtaa 
tttccaatac 
taccagtggt 
gaaaatgaaa 
ctgcatatca 
ttttgttttg 
ctggaatgca 
tctcctgcct 
atttttgtat 
ctgacctcag 
caccatgccc 
cacctgtttt 
ttgatgtatt 
gttacccaac 
gtcagaagat 
aaactagcca 
gagaattgct 
ccagcctggg 
ctacctataa 
gtcatccaaa 
ttccagctgc 
tacacagtag 
cgcagggcag 
tgaatgagca 
ataaagtagt 
ttaaaatatt 
gtatagatga 
ggcaagagaa 
tacttagcca 
tctgatcttg 
aaagtttttt 
atagatattc 
atctagaatt 
tgaattatcc 
tctacagagt 
agacttaaaa 
agaacaagcc 
aaaatgaaag 
gtgttactaa 
tgcagtggcg 
gcctcagcct 
tgttactgat 
aaaattcaag 
aattgtcttc 
tgacataaat 
taaaagtaat 
atagtatttt 
tgtatttgga 
cagactctca 
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agatgaagca 
aaagaacacc 
tagacaatga 
ccattaggcg 
gtagatgata 
tgtaatgtgc 
ttgtttgttt 
tagaaaatat 
cttcaaaaat 
gcaaaatggc 
tgagtcggag 
catcctccac 
tacaggtatg 
tcatgtggcc 
gagtgctgga 
aaaaactata 
accaaggtag 
aaccctgtct 
tagtcccagc 
tgcagtgagc 
tcaaaaaaaa 
atatgtatat 
cttcagttct 
cattcattta 
cacttgatga 
agggtatttt 
aattcttttc 
aattatccct 
aattattgtt 
tcattaaagt 
tgaatgtaga 
cttgatcaag 
tatttcatga 
gattcagcca 
tgttctggat 
cacttaactt 
tgatgtaatt 
tgcccgagca 
agactttcgg 
tgtacgtgat 
ggtaagtcat 
tctgttgccc 
gggttcaagc 
ccacgcctgg 
ctggtctcga 
cactgcgcct 
gtcttcaaag 
taatagacat 
tttttggctg 
gcagatctct 
taaaaaatac 
aggctgaagt 
caccagtgta 
ttattaatca 
tattattttg 
tctagaatat 
agctaatatc 
ttcatttggt 
gttagtcttg 



ctttaatggt 
aaaatgacaa 
tggaattttg 
aaggtaggat 
attctctttt 
tctaaccctg 
ccctagttta 
attacctaat 
ttataaacat 
aattctataa 
tctcactctg 
ctcccgggtt 
caccaccacg 
aggatggtct 
attacagatg 
tgtcaggccg 
gcagattgcc 
ctactaaaat 
tactcgggaa 
caagattgtg 
atatatatat 
atgtatacac 
aggaaaggta 
ttttgtatgg 
tttgcgccag 
tgaaaaattg 
attgtcacag 
catcgttgct 
aatgaaacat 
ggacattttg 
gaattactgt 
agttcagagt 
attccagaca 
cttcaaactg 
attatccctt 
agtatctgtt 
tctcttttaa 
gcaatgttta 
gaacttggga 
gcccttatgc 
ttttatctat 
aggctggagt 
aattctcatg 
ctaatttttt 
actcctgact 
ggcccttttc 
aatttgggag 
atttctgtgt 
ggtgcagtgc 
tgagctcagg 
aaaaattagc 
gggagaatct 
ctccagcctg 
tcctaatgtg 
atataagtat 
ttttgagtag 
ttttaaatat 
cttcacaaaa 
gacgtaggtg 



aaagcaccac 
attaattaaa 
gttctgggag 
aatatagtaa 
tgtcttgtca 
ttgctcccct 
tcaaatatac 
gcttaccttt 
agtttgacaa 
gtcaaaataa 
ttgccaggct 
caagtgattc 
cccagctaat 
cgatctcttg 
tgagccaccg 
ggtgtggtgg 
tgagctcagg 
ccaaaaaaat 
gctgaggtgg 
ccactgcact 
atatatatat 
acacacatct 
tacggactct 
taaaaattag 
cattctagct 
ccttattctg 
agagttacct 
tctctgtctc 
agtgtttatt 
tgtaatatat 
gtcatctcta 
acaattttgt 
taggaacact 
gctcccaggc 
taacctaaat 
tcttcttgga 
gatttgagaa 
aactgcacct 
ggaaaacaga 
agcctgttag 
cctttcctat 
gcagtggcac 
ccttagcctc 
gtatttttag 
ccaggtgatc 
ctgtattttt 
gatttcagtg 
cactgaaaca 
cccatacctg 
agagaccagc 
tggatgtggt 
cttgagccca 
ggtgacacag 
ttttaaattt 
agaaacttaa 
aatctggtgc 
tcacatgcct 
atcagcaaga 
ctaacacagt 



atttcttaac 
gcattaccta 
ctacaaatat 
atatcttatt 
ggaagcattc 
gccttgagtg 
aagatttcca 
cattctgatt 
ttcagtaaaa 
gataaccagg 
ggagtgcggt 
tcctgcctca 
ttgtatattt 
acctcatgat 
tgcccagcca 
ctcatgcctg 
agttcaaaac 
tacctggcca 
gagaatcgct 
ccagcttggg 
gtgtgtgtgt 
atctatctaa 
tcacttttca 
cacttcagtt 
taccatccag 
cacccatctc 
ggagaaaaaa 
caaataccga 
ctttataata 
atcctataag 
ctcctgtttt 
gggaaactgt 
gtggccaaaa 
tattttgatt 
atatatgttt 
aatatcctaa 
acgaatttat 
agggaccact 
tggttattca 
gaaagtacag 
cttttttttt 
tatctgggct 
ctgagtagct 
tagagacagg 
cacccacctc 
aaaatcaagt 
ttgtataata 
agttctttaa 
taatcccagc 
ctcaggcaac 
ggcacactcc 
ggaggtggag 
tgagaccctg 
tcctatgaaa 
atgtacagat 
atttatacct 
ggcactgtgt 
gaggtacatg 
agctagagtg 



agaatatatg 
ttttaatgca 
accctgggtt 
tctcactgca 
taacaacttt 
ggagaagggt 
cacacttctg 
aagtttggtg 
accatttata 
taagattttt 
ggtgcgatct 
gcctcccaag 
ttagtagaga 
ccacccgcct 
agattttttt 
taatcccagc 
caccctgggc 
tggtggcgca 
tgagcccagg 
caacagagtg 
gtgtatatat 
tttttaagtt 
ctcttcacag 
tttgccactt 
taccaagaaa 
tagaggcaac 
cagtctgctc 
aattctaaga 
taggattgtt 
attgtgaggt 
atattgttaa 
aaccctgtag 
taacccaaaa 
ttatttgttt 
aaccttggca 
ttttcaatta 
attcccttgc 
cagaacagtc 
ggggcagata 
tcagctactc 
tttttttgag 
tgctgcaacc 
ggaattacag 
atttcaccat 
ggcctcccaa 
ttcaagaccg 
ataccataag 
taaagtttat 
actttaggag 
atggcaaaac 
tgtagtccca 
gttgcagtga 
tcttaaaaaa 
tgatttttga 
gtaaggctgc 
cataaagtaa 
tctgtctttt 
tagccagcca 
ggacagacat 



gaagaagttt 
ggggttggtg 
ctggattctg 
gatgtaacaa 
tcaaaagatt 
ggtcatttgc 
tctttcatat 
ggcactgatg 
taaaatttta 
tttttttttt 
cggctcactg 
tagctgggac 
cgagatttca 
cagcctccga 
tttttttagt 
actttgggaa 
aacgtggtga 
cacctgtctg 
aggcggaggt 
agactccatc 
gtatatgtgt 
gaaaatgaaa 
ttctgtatta 
ttatcccatt 
ggaaatgtgc 
ggaaacttta 
agaccactta 
aagcaagcaa 
ctggatgtat 
aattctgttt 
tcattagtcc 
gacctctttt 
atgcacgaaa 
gtttttaatg 
catacaggcc 
aatatattct 
cggaacccca 
tcacggaagc 
taagtatcat 
attttaaaaa 
acagtctcac 
tccgcctccc 
gcgtgtgcca 
gttggccagg 
aggcgtgagc 
tcattggttt 
tgggtttaca 
ttattggcca 
gccgaggtgg 
cccatctctg 
gctacttggg 
gccgagattg 
agaaaaggcc 
agtatttgtg 
ttctgacaca 
aaataataat 
atcttactat 
cctaatggtg 
gtgatttgcc 
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tggaggtgag 
gtgttctgtt 
gaaggctgtg 
cccttttgct 
tagcaaacct 
ttatcatgta 
gaagcctggc 
ctcattttta 
aatgaagtga 
ttactcatcc 
gccattatgt 
cagtcctttc 
acatgacgct 
tatctctgca 
tcttttaaag 
tactagtatt 
agtcaaaaaa 
taaaatacac 
aatatttaaa 
ggtggctcac 

aggggttcga 

attagctcgg 
gaattgcttg 
agcctgggct 
tgatgcagac 
tggatattgt 
tagcatacaa 
catactaaag 
tttaatatat 
aaagagaaag 
agcaatttat 
ctgggtgcag 
acaaggtcag 
ataaaaaaaa 
tgagggatga 
acttcactcc 
caaaatctat 
tccaaaagta 
aatggattaa 
ttcaggttcg 
cttgctctcc 
aacttttgga 
taaaacatta 
acaggtgtat 
aaatataaaa 
aaaagctatt 
caaaatgctc 
aatatttttt 
cagcactttg 
ggccaacatg 
gcacacctgc 
aggttgcagt 
tgtctccaaa 
acactttggg 
aacacggtga 
tgcgtgtagt 
ggaggttgca 
tctgtctcaa 
tgtttttata 



gtgtgtttcc 
ttgtctgcaa 
gtcttgagat 
gtgaacaaca 
aatctcataa 
ccataataag 
accatttact 
tcatttctaa 
tataatactc 
acccttcttc 
tttttcttaa 
ctactgagag 
gaagccaggt 
gctggggatc 
ctgattattt 
ttatttctaa 
gacaaaaatc 
ataataaggt 
gtcgattatt 
gcctgtaatc 
gaccagcctg 
cgtcagggcg 
aaccctggag 
acaaagcgag 
tattgccatc 
taaaacttat 
ttttgcactt 
ttccaaatat 
ctgtttttgg 
ccacaggtaa 
tgtaggaaat 
tcgctcacac 
gagatcaaga 
attagctggg 
gaatcccttg 
agcctggtga 
cccattaccc 
aacattctaa 
gtcttaatgt 
cggaccttcc 
aggtgaccct 
gccagttgtt 
tacttcataa 
tttacttaat 
agtcattatt 
ttttaaagca 
atatatagta 
ttcccctgca 
ggaggccaag 
gtgaaatgcc 
aatccagcta 
gagccgagat 
aaaaaaacaa 
aggccgaggc 
aaccccatct 
cccagctact 
gtgagccgag 
aaaacaacaa 
gtgttttcta 



aaatctgaaa 
gaacaagtta 
acccagtttg 
gtatccagta 
tgtatgtcaa 
atttttttaa 
tgttatacct 
aatggggatg 
ataaaacacc 
agccatttca 
attctctttg 
tttgtcactc 
ccttgttagt 
accagctgca 
cagttaacaa 
aatactaatc 
cctgacctca 
gagagcatgt 
gtgaaacaga 
ccagcacttt 
gccatcatga 
ggcgcctata 
gtggaagttg 
actccgtctc 
taaatgttta 
aaataagttg 
agatttgaga 
tacagaagtt 
tctttaagat 
ctattttttt 
ttcaaaatat 
ctgtaatccc 
ccatcttggc 
tgtggtggca 
aacccaggag 
cagagcaaga 
aggggataac 
taaaacatac 
tcaattttgt 
cgagctgatc 
ggtgccattg 
tccatggttt 
ttttcagtta 
attgttgact 
tttaaactat 
caagattctg 
taattattgt 
tctcagattt 
gcgggcagac 
atctctccta 
ctcagctgag 
tgcacaaccg 
aaaatagggc 
gggtggatca 
ctactaaaaa 
tgggaggctg 
atcataccac 
caacaaaaac 
tttcacttct 



tctcagccat 
agcatctgtg 
agtaacaatg 
ttactcatga 
gactaatatg 
gttaacaagc 
ttttgagcaa 
tgataatgct 
agtcgtggta 
ttactgaagt 
ccatttagaa 
aattctgcat 
gggatgaagg 
gcagacctct 
gaagaagatt 
tgggaccggt 
tgcagctaaa 
atttcttcac 
tgtgaaagtg 
gggaggccaa 
taaaacccca 
atcccagcta 
cagtgagctg 
aaaaaaaaaa 
catgtgacta 
tacatgataa 
gcatttttgt 
tctcagctat 
tactaccact 
tttaactcat 
agaaagcatc 
agcactttgg 
caacatggtg 
catgcctgta 
gtggaggttg 
cgccctttca 
tactttttgt 
aaaactatta 
ttcaaataca 
ctaaccatct 
aaatgacatg 
gaaattttgt 
tataagattt 
cattaaaaat 
tttcttgggt 
actttacagg 
aaagacctaa 
tttaaaataa 
cacttgaggc 
aaaatacaaa 
gcaggagaat 
caccccagcc 
caggtacggt 
tgaggtcagg 
tacaaaaagt 
aggcaggaga 
tgcactccag 
aaaaaccaga 
caaagctcca 



tctgctatcc 
gctgagataa 
aactaggtaa 
acatacatgt 
actaagtcac 
ttgaattaaa 
attctattac 
attttgaagt 
cttcagatgc 
actgttaagt 
actataaatg 
attactgcca 
tgacacagcc 
cagcggctcc 
ccctgtttat 
ggtagatcag 
atattagtga 
atctacacta 
cccccaagtt 
ggtgggcgga 
tctctactaa 
ctcgggaggc 
acatcgcacc 
aaaaagaagt 
tgtcacaaag 
aatggcattg 
tgaaagagat 
agagcaactt 
tattaagcac 
cttctatgtg 
aagaaaaaaa 
gaggccgagg 
aaaccccgtc 
atcctagcta 
cagtgagctg 
aaaaaaaaac 
tgtatattct 
tacattttaa 
tatttcagat 
tgtagatgat 
gatggatgtc 
ttgtttttaa 
gctggaaaat 
tatgaagtta 
ttagaggaac 
tcttatagct 
ctaaaaaata 
ccagaagtca 
caggagtttg 
aattagccag 
cacgtgaacc 
tgggcaacag 
ggctcatgcc 
agatcgagac 
tagccagacg 
atagcttgaa 
cctgggcaac 
aatactaatg 
tagaaagttt 
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gttctatttt 
atctacaaga 
ttgtttactt 
aaagaccttt 
tagatataca 
tatctgagaa 
tttgtacttt 
tttcatatcc 
ttagcatatt 
tagtagattt 
.cttttgaatt 
aattagatca 
gataagtatt 
aggaataaca 
gcccacgcct 
gggttcgaga 
tagccaggtg 
atctcttgaa 
cctgagccaa 
gttagtatct 
aataatgata 
cttctttttc 
aatgaacatg 
gccaaagaca 
tttggatcgc 
cttcagagtt 
attaacaaaa 
aaacagaaat 
tatgaatggt 
tagggtttat 
tacgttatta 
aaagagcttt 
ttaataagtt 
tggctgggca 
atcacctgag 
taaaaataca 
ggctgaggcg 
gctactgcat 
aataaaactc 
acctttcact 
aagaagaaaa 
gtgtgtaatt 
gtctcccttt 
tgtatatttc 
ttcagggaac 
tagaaatgag 
taaataatct 
ttgcattgtc 
agtgaaattc 
ttttacatat 
tggaaaaggc 
tctgtgtatt 
ttagatacaa 
acttttatgt 
gataatttat 
ctttatggac 
ccagtaatta 
agcatctcta 
tataaggact 



tatttttaaa 
cttactttgc 
acagaagaac 
aatggcaggt 
tgtaaatgtg 
gttactcaga 
gttcagcttt 
ttttacagga 
tttctaacca 
ttcatagaat 
atactgactt 
tgaacatact 
ggggagccat 
tgatcaaaga 
gtaatcccag 
ccagcctgac 
tggtttcagg 
cctgggaggc 
cagagtgaga 
tatggcagac 
atatgacggt 
cctgacagtc 
acttgttgaa 
aggaagatgc 
attaattgtt 
ccattaggtt 
ggtacaaaat 
tacccagtaa 
ggtctttgca 
tttatatttt 
aaaaattaca 
gtgattgggt 
gatatttttt 
tggtggctta 
gtcaggagtt 
aaaattagct 
ggagaatcac 
tccagcctgg 
tgtctcaaaa 
gtgcaatctc 
ttgtctattt 
tttaactgtc 
gaattatggg 
cttttttaat 
agtttgtcct 
ctcatgactc 
gaggatgtgt 
tatacatgca 
tatcttgcaa 
gctatgtggt 
ctgaattatg 
attctttatt 
agaaatcatt 
gaggattgag 
acatcttacc 
agatactcaa 
taaagtttag 
tgacttacac 
ctataattat 



aattacagtt 
aaggcatgtt 
attctcagtt 
ttaatagtaa 
tatacaccca 
aacccctttg 
gaagaatttt 
gtctaggctt 
tttcgatctt 
ggtattctgc 
ggaaaattga 
ttggaggcct 
ttgatcagtt 
tcatgaaaga 
cacttcgcaa 
caacaaaacg 
tgcctgtaat 
agaggttgca 
ctccaactca 
tgcaaatatt 
tttttatttt 
ggatatgttg 
attaaagaag 
ttaccatatg 
tccagtaaaa 
ttatattgta 
aacaggttat 
aaaggatgtc 
aagagcattt 
caaaaaaatt 
aatgatagaa 
ttgaaagatt 
tctgtgtaga 
tgcctgtaat 
caagaccagt 
gggcatgatg 
ttgaacctgg 
gcgccgagat 
aaaaaca'aaa 
aagaggaaga 
cagtgtcttt 
atgaaatact 
aagtttcttg 
acaggataat 
gaccacacat 
attaaaaaat 
ttcatacttt 
aaagactatt 
atgctataga 
tgcctttggg 
ctcttatgtg 
ttgggacctt 
aaaattatta 
aactaactta 
tgtttatcag 
aggaatccga 
gtagacattt 
cctggatatt 
aaaacatgtc 



aatatagttt 
atttttaggg 
ttcaaaattt 
cttccagagg 
tatatttgtg 
ttttaatcct 
tagtaactta 
agaagttagt 
aattgtaact 
ttcacttaag 
tctcgtttaa 
tgatcctcag 
caggggaaga 
tctgtgaaag 
ggcagaggca 
aaaccccatc 
cccagctact 
atgagccaag 
aaaaaaaaaa 
catatttaaa 
gttaactcta 
cggtcactat 
tttacagaag 
tattctttct 
ctcttttacc 
cttttcctcc 
gaggaaatga 
agaaattgac 
atattttctt 
gttaaacatc 
tctttactca 
ttgaattata 
tttaaataat 
cccagcactt 
ctggccaaca 
gcgagtgtct 
gaggcagagg 
agcgccattg 
acggtaagct 
ttttctaaat 
tttagcaaga 
gtaatgcgca 
tctgtgatat 
aatatggggt 
atgttatcag 
aataaacgat 
agcatcaacg 
aagttaggat 
ataacttgca 
ttttctgtac 
aagtaaattg 
ctgtgtaaaa 
atgtatatat 
gtaaactaac 
cctcagaatt 
atgggcatgt 
tctgagccag 
tgaaagcatt 
aatagagaat 



atattggaag 
aacttagggt 
tctgtcagtt 
agaaactttt 
tgtgaactaa 
tggtgtatta 
gagtggtcct 
tttttaatga 
aattatttta 
tgtaagtgaa 
gaaaaagaga 
actaagggga 
aagtatacgt 
aataggggcc 
ggcggatcac 
tctactaaaa 
tgggaggctg 
atcacgccac 
aaaagaatag 
cttaggttaa 
ttattaatat 
ctaacacaaa 
attttggtca 
ttcatagata 
acagggaaat 
attacttatt 
gcgatatatg 
atacaaatat 
ttttttttac 
attcttatca 
gggatgggca 
gacagtgctc 
ttctttaaaa 
tgggaggcta 
tggtgaaatg 
gtaatcccag 
ttgcagtgag 
cactccagcc 
tttatattta 
tagacattgt 
ttgaccaaaa 
cctttgctaa 
ggggtgtctg 
cttgtccttc 
tgtaatgctt 
gctattttaa 
ctgagttgtg 
gaactagatt 
ttttaaagtg 
agattgtttt 
atataaccta 
gaaaatatta 
gaaaattctg 
actgaattct 
atacagtact 
tctatttctc 
caagtagatc 
ttgtctcacc 
tgacttctga 



aattaaaata 
gattgaaaag 
tcagatcttc 
acacggtaca 
ccataatgct 
ttttttattt 
tttcattatt 
atctctgata 
aaattctggt 
tgatattttc 
taaggccaga 
ttgtccttct 
ttctacacaa 
gggtgcaggg 
ctgaggtcag 
atacaaaaat 
aggcaggaga 
tgcgctccag 
ggagtttatc 
agttcatagc 
cctccctacc 
acctacagtc 
agaaggctaa 
tttttgtcta 
acacatctca 
aaatactcct 
aacggcataa 
ttacaatttt 
taaaatgata 
atgtaaaatt 
ataaaacagc 
ttaatagttt 
gcgtaagctt 
cgatgggtgg 
ccgtctctac 
ctactcagga 
ctgagatcgc 
tgggcgactg 
attaactgat 
ctttacctta 
caggtggtag 
taaggaaatg 
tgtacatata 
accttttaag 
tatgaagctt 
tattttcaaa 
ccttgtagaa 
ccctaattac 
tatttgcaca 
tgttattaaa 
ttgactgtat 
cctaaattgg 
tttttcaatg 
ttgtggataa 
gctgtgtgtt 
tggctcttga 
' atagttgtta 
ctgagaaatc 
gtccactgtc 



32640 

32700 

32760 

32820 

32880 

32940 

33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 

34860 

34920 

34980 

35040 

35100 

35160 

35220 

35280 

35340 

35400 

35460 

35520 

35580 

35640 

35700 

35760 

35820 

35880 

35940 

36000 

36060 

36120 
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aactacatgc 

ccactttcag 

gcaacacagt 

cccttttctt 

ctttcttttt 

atgtaacctc 

gcagttaaat 

atgtttagat 

gtgttgagtg 

ttattttgat 

tggctttgtt 

ctacacaatc 

tctttcccag 

aatatttata 

tgtaacttaa 

gttctttata 

aaactaggaa 

aacatattta 

aggtgcccac 

agaaatgaaa 

aaaattgata 

aattatctta 

actattgggg 

ctattaccac 

aaaaaaaaaa 



agaaaaacag 

agttgataaa 

gcccattgtt 

tcctactttc 

tttgctagtc 

ttgcaggaag 

tgctgctgct 

catttagatt 

cttttaggta 

ctacctcatg 

cttactggtc 

aaaaatgtat 

ctgcgagttc 

ccatagaaac 

aataggaaga 

gatgcataaa 

tagaagggaa 

attgtgaatt 

tatcaccact 

ggcttacaga 

taaacgtaga 

gcaacaatta 

aattgaagtg 

accactgcac 

aaaggaaaat 



aatgagtgta 

gaggcatacc 

gacactaatg 

atcccttcat 

cacatcacgt 

accacccaaa 

gctgtgttaa 

gtaagttctg 

cttagtaaag 

gattatgtga 

tagtgaagtt 

tcaaactagc 

agtgacctca 

cagcaagagt 

ttaatactat 

aagcaataca 

cttcttaata 

tttgaaaagc 

ttttctcaac 

taagaaataa 

aataatgtga 

gaaattgaaa 

ggaggatcac 

tccagcctgg 

actaattaca 



cttgattagt 

tacctgatgg 

ttattttaag 

ttgtatcctc 

atttccatta 

ctacccccac 

gatttttctc 

agagttacat 

acttggggaa 

gagacttata 

atgataaatt 

tcaccctggc 

tactagtaat 

tcagttagcc 

ttactgcatt 

att cagcact 

gaagaagttc 

gtattcctcc 

tttgtacaga 

aattgtcatt 

atttaacaag 

ttatgagagt 

ttgagcccag 

gtgagggaac 

atagtatcag 



gccacagaag 

tgctgtatgt 

tttgtgagtt 

ctcttatttt 

aaacctaatg 

cctacttttg 

tggagcatgt 

tttccccagc 

aaaaaaaggg 

aaataggtaa 

tctgtctcgg 

ccaccaaaac 

ttaaagtctg 

gtaaaaaatc 

tatggaaata 

tgcttttttt 

tacaaaaaaa 

agagatcagg 

ttctagccag 

cacagatgac 

gttgcaggat 

ggtgtgtgcc 

gagttcgagg 

gagaccctgt 

agagcatcaa 



caccatttag 

tgcaagtgaa 

ttcacaggtg 

aatatggtat 

atgatgaacc 

ggtttcttta 

tctattgatc 

aattaatata 

agttatattt 

gtttataaag 

tgattgagaa 

cgatagtaca 

ctctgatggg 

aaggactgcc 

aaggagacaa 

ttaactgcac 

acctagagcc 

aaaaaggcaa 

tgcaagcaaa 

atgattgtag 

acaaagtcaa 

tatagtccca 

ctgcagtgag 

ctcaaaaaaa 

atgcctcaaa 



36180 

36240 

36300 

36360 

36420 

36480 

36540 

36600 

36660 

36720 

36780 

36840 

36900 

36960 

37020 

37080 

37140 

37200 

37260 

37320 

37380 

37440 

37500 

37560 

37620 



<210> 9 
<211> 444 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 9 

Met Ser Ser Thr Ser Pro Asn Leu Gin Lys Ala 

1 5 10 

Lys Ala Ala Gin Glu Asp Lys Ala Gly Asn Tyr 

20 25 
Leu Tyr Gin His Ala Val Gin Tyr Phe Leu His 

35 40 
Ala Gin Gly Asp Lys Ala Lys Gin Ser He Arg 

50 55 
Tyr Leu Asp Arg Ala Glu Lys Leu Lys Glu Tyr 
65 70 75 

Lys Lys Ala Gin Lys Pro Val Lys Glu Gly Gin 

85 90 
Glu Lys Gly Asn Asp Ser Asp Gly Glu Gly Glu 

100 105 
Lys Lys Lys Leu Gin Asn Gin Leu Gin Gly Ala 

115 120 
Pro Asn Val Lys Trp Ser Asp Val Ala Gly Leu 

130 " 135 

Ala Leu Lys Glu Ala Val He Leu Pro He Lys 
145 150 155 



He 


Asp 


Leu 


Ala 
15 


Ser 


Glu 


Glu 


Ala 
30 


Leu 


Gin 


Val 


Val 
45 


Lys 


Tyr 


Glu 


Ala 


Lys 


Cys 


Thr 


Glu 


60 










Leu 


Lys 


Asn 


Lys 


Glu 
80 


Pro 


Ser 


Pro 


Ala 
95 


Asp 


Ser 


Asp 


Asp 
110 


Pro 


Glu 


He 


Val 
125 


He 


Glu 


Arg 


Glu 


Gly 


Ala 


Lys 


Glu 


140 










Phe 


Pro 


His 


Leu 


Phe 
160 
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Thr 


Glv 


Lys 


Arg 


Thr 


Pro 




Arg 


Gly 


lie 


Leu 


T .<2»11 


t lie 


<X \ \t 
biy 


tr jl \j 


fro 










165 










170 








175 




Gly 


Thr 


Glv 


Lys 
180 


Ser 


Tvr 


Leu 


Ala 


Lys 
185 


Ala 


v etx 


Al ^ 

nl d 


1 1 1 L. 


blU 

190 


.riX cX 


A ex y-\ 


Asn 


Ser 


Thr 


Phe 


Phe 


Ser 


lie 

X X cs 


Op y- 


c? c?x 


OCX 


nop 


Leu 


vax 


Ser 


Lys 


Trp 






195 










200 










9 0 R 
\j -j 






Leu 


Gly 


Glu 


Ser 


Glu 


xiy o 




V dx 


Lys 


Asn 


L6U 


r ne 


pi n 

bin 


Leu 


Ala 


Arg 




210 










X X »J 










9 9 n 








Glu 


Asn 


jj y o 


Pro 


Ser 


Tip 


Tic, 
X JUG 


t ne 


xxe 


Asp 


bill 


xxe 


Asp 


Ser 


Leu 


bys 


225 










230 










235 










240 


Glv 

oxy 


UCl 


7\ t" rr 
/~vx y 


Ser 


Glu 
245 


7\ en 




Ser 


blU 


ax a 

OCA 


ax a 


Arg 


Arg 


He 


Lys 
255 


Thr 


Glu 




Leu 


Val 


Gin 


rJcL 


uxn 


^xy 


v ax 


bxy 


vax 


Asp 


Asn 


Asp Gly 


He 








260 










265 










270 








vol 


Leu 


Gly Ala 


i nr 


Asn 


xxe 


Pro 


Trp 


VaX 


Leu 


Asp 


Ser 


Ala 


He 






275 










280 










285 








nj. y 


jrix y 

290 


Arg 


Phe 


Glu 


Lys 


Arg 
295 


Tin 

xxe 


Tyr 


Ti- 
ne 


Pro 


Leu 
300 


Pro 


Glu 


Pro 


His 


7\ "I -a 
/-i J. d 


Arg 


ax a 


Ala 


Met 


rile 


Jjys 


Leu 


HIS 


Leu 


Gly 


Thr 


Thi" 


Gin 


Asn 


Ser 


—> W J 










*3 1 n 










olb 










320 


Leu 


Thr 


Pin 

IjXU 


Ala 


Asp 
325 


rne 


Arg 


bill 


Leu 


Gly 
*3 *a a 


Arg 


Lys 


Thr 


Asp 


Gly 
335 


Tyr 




r* i \t 
biy 


a j. a 


Asp 
340 


lie 


Ser 


lie 


xxe 


val 

3 ^ 3 


Arg 


Asp 


AX a 


Leu 


Met 
350 


Gin 


Pro 




Arg 


Lys 
^ r r 


Val 


Gin 


Ser 


Ala 


Thr 


nlS 


Phe 


Lys 


Lys 


Val 

*3 c c: 


Arg 


Gly 


Pro 


Sear 


Arg 


TV "] 

ax a 


Asp 


Pro 


Asn 


His 


Leu 


VaX 


Asp 


Asp 


Leu 


Leu 


Thr 


Pro 


Cys 




370 










375 










380 








Ser 


Pro 


Gly 


Asp 


Pro 


Gly Ala 


lie 


Glu 


Met 


Thr 


Trp 


Met 


Asp 


Val 


Pro 


385 










390 










395 










400 


Gly 


Asp 


Lys 


Leu 


Leu 


Glu 


Pro 


Val 


Val 


Ser 


Met 


Ser 


Asp 


Met 


Leu 


Arg 










405 










410 










415 


Ser 


Leu 


Ser 


Asn 


Thr 


Lys 


Pro 


Thr 


Val 


Asn 


Glu 


His 


Asp 


Leu 


Leu 


Lys 








420 










425 










430 




Leu 


Lys 


Lys 
435 


Phe 


Thr 


Glu 


Asp 


Phe 
440 


Gly 


Gin 


Glu 


Gly 











<210> 10 
<211> 17315 
<212> DNA 

<213> Artificial Sequence 



<220> 

i<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 10 

cctcccaggt 

ccaccaccat 

gecaggctgg 

ctgggattat 

gtgaagagag 

tttttttttt 

tggtcaegge 

aeggagtage 

tagcgatggg 

gcctgcctcg 

acttggatgt 

gatcacttgc 



teaagegatt 
ggccggcgaa 
tcttgaactc 
aggegtgage 
attaaggtgt 
tttttggagg 
tcactacagc 
tgggactaca 
ggttcacctg 
gcttcccaaa 
tttttgeect 
ctctggtgcg 



ctcctgtctc 
tattttgtat 
ctgacctcaa 
caccgcgccc 
gecaattttt 
cagggtcttg 
ctcgacctcc 
ggtgtgcgcc 
ttgecgagge 
gtgttgggat 
ccattttttt 
tttttttgtt 



agcctcccaa 
ttttagtaga 
gtgatccacc 
ggcctctcct 
tggttttttt 
cttcatttcc 
caggctccag 
accatgccca 
tgttctcgaa 
tacaggcatg 
tctctcacag 
tgtttgtttt 



gtagctggga 
gacagggttt 
tgcctcggcc 
ttgtatttaa 
tttttttttt 
cagggtggag 
caattctccc 
gatacgtttt 
ttcctcggcc 
agcccccata 
gatagacttg 
gttttgtttc 



ttacaggtgc 
caccatgttg 
tcccaaagta 
ttagcatttt 
tttttttttt 
tacaatggee 
aactcagctt 
ttattttttg 
caagcaatcg 
cccagcctca 
tggacaaatg 
ttaatttagc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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gacgaggtct tactatgttg cccaagctgg tcttgatcct cggctcaagc aatcctcctg 780 

ccttggcctc ccaaagtgct ggaattacag gcgtgggcca gtacacccag ccctagaact 840 

gctttgatca ttgtatcccc agaaactagc acacaacgct gcacacagta ggtgtgcatg 900 

cctgtaatcc cagctactcg ggaggctgaa gcaggagaat cgcttgaacc cgggaggcgg 960 

aggtcgcaat gagccgagat catgccactg cactccagcc taggcgacag agcaagactc 1020 

cgtatcaatc aatcagtcaa tcaatccaag caaaacaaaa ctggctttga acagtcagcg 1080 

gcatcccacc agggattgag ccccagtgtg gggcacactc tgtttacatt agctagctca 1140 

tttattcccc acaaaagcca gacaaggcag gtactaatta tgatcgtgcg atattaaaca 1200 

tggagaaacc aacgctctga taagttcgtt ttattgtgag aataaaactg ctcaatgggc 1260 

gctgttattc caagcgaggt caggatttcc ggattctacc ccgctccgcc cctggccccg 1320 

ccccgagaat gtacgtcccg ccaggccccg ccccggcctt gtcccaccca ccccgtcccc 1380 

gccccctgtc cgtctcgccc tagtcccacc tcctcgtccc cagtcagtcc tgcctctcga 14 40 

accgccccct gccttcgccg aggcagggag aggtgcctgc agtgcgagcc gtggacgcta 1500 

gggggcagcc gctcggctcg ggcccgaccg ctctgggctg cgctcctcgc ttgccctcgg 1560 

actcggctcc cgctgcgagc ggccgccctg cccgcgcacc gcgctcagcg cccaccgccg 1620 

ggcttcccgc gccggaccca gtacctcggc tccccggggc cggaccgagg ccgcaagcag 1680 

cgccgcgggg tgtggggcgg acccaggaga tgaaatgaca acgtcaaccc tccaggtact 1740 

tcgtgcggcc cggcccgact tcggaggccg aaggcagcgg ggcctgcggg ccgcagagcc 1800 

gggcggcggg cgggtggtca ggtgggcgcc tcccagcccc ggcctcgcga ccccgctccg 18 60 

gccgccccct cgccgcttcc gcccgtgtct gtttccacgc gctcggagcc gggtggcatc 1920 

tccaccctca ggccttccgg ccctgcccgg ctggcccagc cccagggccg cttcgtgtag 1980 

ccggcgcccc ggtaccgccg tggggatcag cctcacccga gccgcacagg caccctcagg 204 0 

aagctcggcc cggggtccgg ctgggcgttc agacagcgat gctcgctccc tcctctcctg 2100 

tctcgctccc tcctctcctg tcccgggtcc tggaagtccc tcgacctcac ttcgaacaag 2160 

ccccggcccc ttccccaatc ttcgacttcc tcttctcggc cagcgaccct tcattcattc 2220 

cctagatcct gcagggtggt gctgggatgg cgccacctgg gtgaacaact gacccaccct 2280 

ctaggtgctg tcagtcctgc ggggccagaa gagatcaaca tgctcaggcc tcattccggg 234 0 

gtttgaatct cagctctgcc actcaccagc aagtgacctt gggctagttt ttttttttca 2400 

tcttcgggtc tcgagtttcc tggtctgtaa aatggggtaa tgccctctag gttgttgtga 24 60 

tgatagcttt aaattgaata attacggtaa agcattcttg tatgtctggc acatagtagg 2520 

tgtccattaa gtgacagcca acaaagtgcc tgggagagtt aggcaagact tcatggaaga 2580 

agggacatca gagttccagt agcattcaga gacctggagg tggaggaggg cagaagtagg 2 64 0 

agcagacatg gaatgagttg gggtaccagg ttaggaatta agagtccata gccagatccc 2700 

accgtaccac cagcagaagc cgcagatccc gtctgcccag gcaggttttt tatggcccgt 27 60 

gaaccgagaa tgtttttcag tttcaaacac aggtgtattt atttattttt aaattttatt 2820 

tatttatttt tttgagatgg agtctcactc tgtcacccag actggagtgc agtggtgtga 2880 

tcttggctca ctgcagcctc tgcctctcag gttcaagcga ttctcctgcc tcagcctcct 2940 

gagtagctgg gattacgggc acccgcaacc acgcccggct aatttttgta tttttagtag 3000 

agaccgggtt tcaccgtgtt ggccaggctg gtctcgaact cccgacctca gatgatctgc 3060 

ccacttcggc ctcccagagt gctgggatta caggtgtgag ccacgggtct ggcctaaaca 3120 

cagctttatt aaggtataac ttgcgtgctt accatataac tcacccgtta tatatttttg 3180 

tttttattaa tttttttttt tgagacagtc tcattctgtc acccaggctg gagtgcagtg 3240 

gcacaatctc aactcattgc ggcctccgcc tcctgggttc aagtgattct cctgcctcag 3300 

cctcccaagt agctgggatt acaggcacct cccatcacac ccagctaata tttttttgta 3360 

tttttagtag agacggggtt tcaccatgtt ggccaggctg gtttcgaact cctgacttca 3420 

agtgtgatct gcccacctcg gccgcccaaa gtgctgggat tacagatgtg agccactgtg 3480 

cccagcgcaa ctcacccatt taaagtgtat aattcagtgc ttttagtata ttcatggttg 3540 

tgtgaccatc accataatca ctttttagaa tattttcatc acctccaaaa gaaacccctc 3600 

acccacaacc cctctctctc ccagccctgg gcaaccactg ctaatctact ttttttctct 3660 

gtgggtttgc ctacagagaa acccacttta tataaacaga gtcatacaat aggtggtcct 3720 

ttgtggctag cttctttcat tatgacttaa cataatgtca taggttttta cattttttaa 3780 

atggcttgaa aaacagtatt tcatgacaca tgaaaatcct atgaaattca catttcaggg 3840 

tctataaata aagttttatt ggaacacatc cctgttcgtt tattgatata ctgtctacag 3900 

ctgcttttgt gccatcaagg tagagttgag tcgttgctac agagattatc tggctggcaa 3960 

atcctaaatt atttaccatc tggtccttta cagtaagcat ttgccagccc ccattagcac 4020 

tcttttcttt catcctcatc ttttgccata cgtgttacta gagatgcggc ggggctggca 4 080 

gtttctcacc ttaccctcct gttctgtaaa actgtttctc tccattgaaa cttggacccc 4140 

ctgtgtgtga aaaactgtat gtggctgcca gaaaagtcca gtgcactctt tttttttttt 4200 

ttttttttga gatggactct tgctctggtg tgatctcggt tctctgcagc ctccacctcc 4260 
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tgggttcaag caattctcct gcctcagcct cccaagtagc cgtaactaca ggcacgcgtc 4320 

accacacccg gctaactttt tgtattttta gtagagatgg tgcttcactg tgttagccag 4380 

gatggtctca atctcctgac ctccggtgat ccgcccgcct ccgcgtccca aagtgctggg 4440 

attacaggca tgagccaccg tgcccagccc cagcacactc ttaggtgtat tactagaatt 4500 

tccaaaagga gggaggggat gggtcagtgc tgctccttat gggacaaaga ataccttcag 4560 

tattgttcta aggatagacc caagcatgaa gctacagaga gtgaaaaagt ccaaggtact 4620 

gggggtgtgt gttctggaga cgagaagcca ccagggagag gagcgctgcc ttcctgtctc 4 680 

atctgaagga acatcatgca gaagagggca gacaagggca gcctgtgcct aagctcattt 4740 

ccaagaaggt ttggggctcc cagccattca ttgtaggtag tagagggagc agaggcttga 4800 

aaataatggc cagcactcac ccctcttctc atctcttcaa tccgctgagt tcatttcaca 4860 

tcagcctggc ccagcctccc ctggcctgtc tgttttgtcg agctgtctgt gtgtgtttct 4920 

ttcggagact tcttccttgc agataagaga gggctgaaca aatgccccgg ccctgccaag 4 980 

agccggtcag gaggtttatt gtttgctgtg cctggaaccg tgggtctgcg gtgggactgc 5040 

ctgcctggta actggtgccg gatctgagga ggagaattct ggtcatgctt cgtgcaggcg 5100 

ttggtgaagc tgcggttttt aaaaccaggc tttaaaacca tgtaggtgct ttactgaccg 5160 

gtcgcggtgg ctcatttctg taatcccagc actttggaag ccaaggcggg tggatcactt 5220 

gatgtcagga gttcgagacc agcctggcca acatggtgaa actctgtctg cactaaaaat 5280 

acaaaaatta gccggatgta gtggtgggca tctatcactt gaacccagga ggtggaggtt 5340 

gcagtgagcc gagatcgcgc cagtgcactc cggcctagat gatagagtaa gactctgtct 5400 

caaaaaataa taataataaa taaaaccacg taggtgcttt cagttcagcc taaggcactg 5460 

gcctacagga tttatgagat tcccaaagac actcattctc acagagctct tagaaggttc 5520 

cagagggaca cagttctctt ctgttggcag cagggagcaa gcaagtactt ttcttactgg 5580 

agctactggt atgtggagca ttgtgtgtaa gcgtcccttc ccagggtgtc actttagaca 5640 

gcgaaggaaa ctgtcttctc tctgcagcct ggagaagcaa gctgcatgga gccaggagga 5700 

ggaggagaga ggcctttcag tgtatcctgt gcaaacttgc agtccgggca ggactttcct 5760 

tctggctggc tcttctctgg ccctgaggcc gggtgcaatt tctggtctcc ttgctaaaca 5820 

aaccagcagt cacttgctcc cgcgtgtccc ttctgccagg ctctcagctc gggtctgttt 5880 

ctctttccct tgaggcagca gaatagaaag tcccatgaac ttgccccttg agccctgacc 5940 

tgagatatta ggtcaggttt ccacttgctg aaatccagag cgtggctttt aaccgtctga 6000 

gcttattgtg aagatggccg gctggttttc aaagcctcct ttgaaggcct ttctagccga 6060 

tgttgaaaaa gtaggccaat gaaatggccc gcaagcaata aatccacagg cccagcggca 6120 

tccctcgtca cgtttcatcc ccatgcctgt gaccccaggg actttccacc tctccgagga 6180 

agcactgagc catttgcttt gcctttccag aaagccattg atctggtgac gaaagccaca 6240 

gaggaggaca aagccaagaa ctacgaggag gcgctgcggc tgtaccagca tgcggtggag 6300 

tacttcctcc acgctatcaa gtgtgagtca cacgaggggt cctcaggctg ccgcactcca 6360 

gaggggagcg gaggagaggg ggtggcgcac gagcctcaca ggggcccctc tgtgttccct 6420 

ttcacagatg aggcccacag cgacaaggcc aaggagagca ttcgagccaa gtgcgtgcag 6480 

tacctagacc gggccgagaa gctgaaggat tatttacgaa gcaaagagaa acacggcaag 65'4 0 

aagccagtca aagagaacca gagtgagggc aaggggtgag tgtctgcagc gtccgcccag 6600 

gaacctgggc cctcctcacg gctccctatc attcctggcg ctcatgctgc cttggacttc 6660 

cctgtgggaa tggtcctcat agtgcaggga tgacagtgcc agcagctgct ttcagggagc 6720 

atggcctgga gcacttccat gttcaccgtc tcattggatc tgacccacaa gccatgggga 6780 

gagggggagc ttggtgcaga tggtagagcc tctcaggacg ggaccccagc tctgtcacct 684 0 

cccagcctag tgccaggacc ccgtgtgcct accatctcca catcttggaa gtggggaggt 6900 

cgtggaactc acctccgagc gttgccacag ggagcagtga gatgacgtgg gcagtgccct 6960 

tggagcttgg cacacgagag catgcaatga gtgccagctg ctgtgactgt caaagagtca 7020 

cctgcctcca acaggtgggg cggccggctc gggttaggag ccctgccgag gtggcacagg 7080 

atggaacttg gacccccaaa gcagggcggg tggccccgat tccccagcca gggtcctctc 714 0 

cacagcctct cacctcaggt actgggttcg cccaccacgg tgatcccagt catatgaaca 7200 

caaatgctgg tggtatctcg gaggcgtggg aggggaagtg agggggtccc ccaggcagga 7260 

gaggggtggg tggtgtcagc tgcctcacca gaggcgaggc ctcctcctgg atggagggtc 7320 

tgcaccacgt tcccatccgc agagggagag gtctggcgag gcaggggtgg aggccaggga 7380 

tctccaataa ggattaaatc aagttttgag gatgacagat tccatccagg gaggaagtag 7440 

ccctggggtg acagtggctg cgttctcccc aggagttccc tttcgatttc ggttttgctt 7500 

tctcctctgt ccttcggcag tggctgcttt ggagtcaggg gcacatgctg tccctgagtc 7560 

tctacaaaaa gcttctggtg cggggcatgg tggctcacac ctgtaatccc agcactttgg 7620 

gaggccaacg cgggcagatc acttgaggtc aggatttcaa aaccagcctg gccaacatgg 7680 

tgaatcccgt ctctattaaa aatacaaaaa aataggccgg gcgcggtggc tcacgcctgt 7740 

aatcccagca ctttgggaag ccgaggcggg cggatcacga ggtcaggaga ttgagaccat 7800 
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cctggctaac acggtgaaac cccgtctcca ctaaaaatac aaaatattag ccgggcgtgg 7 8 60 

tggcgggcac ctgtagtccc agctactcag gaggctgagg caggagaatg gcgtgaaccc 7 920 

gggaggcgga gcttgcagtg agccgagatc gagccactac actccagcct gggcgactga 7980 

gtgcgactct gtcttaaaaa aaaaattagc caggcattgt ggcatgtgcc tgtaatccca 8040 

gctactcggg aggctgacgc cagagaattc agagaattac ttgaacccgg gaggcggaga 8100 

ttgcagtgag ctgagatgga gattgtgcca tcgcactcca acctgggcac catctcaaaa 8160 

aaaaaaaaaa aaaaaaaaga agcttctggc agctgcattg tttgactggc tttttttttt 8220 

tttttccccc agacagggct tcactgtctc ccaggctgga gtgcagtggt gcgatcttgg 8280 

ttaactgcaa cctctgcctc ccaagtagct ggcaccatag gtgcgtgcca ccatgcttgg 8340 

ctaattttgt gttttttgta gagacaagat ctcgctgttt cccaggctgg tctcaaactc 8400 

ctgggctcaa gcgatcctcc cgcctcagcc tcccagagtg ctgggattac aggggtgagc 8 4 60 

caccgcaccc agcctgggct gtgagagcca ctgcaggtgg cctcccagag tatgaggtgt 8520 

gagtcaccgc acctggcctg ggctgctctt taagtccagc tgaagcacat ttcccttacg 8580 

gccttcagga gagggccgac ccggattccc atctgctccg cgaagaaaca gggcgggtta 8640 

taggagtgtt tgctgggggg gtcctgaagt gagctgctga gagcattggc ccaaatgtcc 8700 

ctgaagtact tgcttgagtc acacggcact gttaagccag gaaggcttcg ttccttaacc 87 60 

agtgtgcagg cagcggagcc tggacttgct gggagcacct ctgtgctcca ggctgaaggg 8820 

ccagcttgtg actttccgtc tcccttccca gcagtgacag tgacagtgaa ggggataatc 8 8 80 

cggagaaaaa gaaactgcaa gaacagctga tgggtaagtg gctcgcggcc ctgtggcagc 8940 

ggcaggggat gagagtgggt ccgctgctgg gttggctcat gccccttggc cgggcccggg 9000 

cccgggccgg cctccctctc gcaggtgccg tcgtgatgga gaagcccaac atacggtgga 9060 

acgacgtggc cgggctggag ggggccaagg aggccctcaa agaagctgtc attttgccaa 9120 

tcaaattccc acacttgttc acaggtgaga gttgagccat ttcaaacccc aatgtaaaaa 9180 

ttccaggctt ccgcagggct ggcactccga ggcacccggg agtcagtggc gactoagccc 9240 

ccggggcctg cagagggcca ggcctggctg ctcgctggtg tccacaacac tgctgtggct 9300 

ggagctatga gatgacgatc acgtcacagg gtgggggcag gatcaggaag cagtgccagc 9360 

cggtgtcact gtgcaaggtc atggggtcat agcaggggca ctgtgcaagg tcatggggtc 94 20 

atagcacagg gcacagcagg gaatgtagct cccatcactt gtttgccagt agagtccgtt 94 80 

gtttcacaga atgccctgtt ttactgtatg tatgggactc gagaccctct gcttttccta 954 0 

ttcctttccc cagtcaggag gcctaacttc tgtggttctc tgttgcaggc aagcgcaccc 9600 

cctggcgggg gattctgctg ttcggacccc ctggcacagg gaaatcctac ctggccaaag 9660 

ccgtggcaac agaggccaac aactccacct tcttctctgt gtcctcctca gatctgatgt 9720 

ccaagtggct gggggagagt gaaaagtaag tcggccacca ggccatgctc tgccagcagc 97 80 

cccggcactg ctagtgaccc agcggcccac cctgtggagt cacctctcag aggagtggtt 9840 

tggttttcag ggagggtacc caagagcagt gtgtcgctag cttatcggct gggacacagt 9900 

cccaacaaag acaagtgaag gtggaaggga cctgggagag gtgaagtgct ccacaagaga 9960 

agggaaagga agtgaggata tttttggggt ctgggttgcg gtccctgttg gggcagtgtg 10020 

tgggggacat ggttccccac actgactcct gttttgcgtg tgatgagacg ggctcggaag 10080 

aaggggactg acgtgttgta agtgcagagc tagcgcatgg ggctcctggg attgaggctc 10140 

tgcgtttggg gctgacacca aggtgtgctc ctcctcttgg tgcagtgtgg cccgagggct 10200 

cctcaccacc acgttttccg caattccggc atgaccgtgg cctgcgcctc cctgtgggaa 10260 

gggtgagaag agggaagtgc cgggagccca ggcgggcacg gacgtgaacg tcttgtcctc 10320 

accccctttc tcaccttcac aggctggtca agaacctgtt tgagctggcc aggcagcaca 10380 

agccctccat catcttcatc gatgaggtgg attccctctg cgggtcccga aatgaaaatg 104 4 0 

agagtgaggc cgcccggagg atcaaaacgg agttcttggt ccagatgcag ggtgtgtgcc 10500 

gggcccaggg cccccttggt tcttgttgca cctgaagcca accctgggct ttattctgag 10560 

ctgcggctgg atttggttgt tctcagcctc ggagaggcac tccccagctc cagcccctca 10620 

gggcacgggt ggacttcaat tcccaagagg tgcctgaggc ctctgcctca cgggccactc 10680 

cacccctccc atggcaggca gtgccatagg tctcacctgg cacagccaga ccaagatgtg 10740 

gtttaatctc acttgggacc ctgccagtgg tgggtggcac agggatggct tcaattgctg 10800 

acacacaaag cccccggggt ctgtccccag gtttcaactg acccgtgcag gtgtccagag 108 60 

tctgagtctt tgtctccctt tctccacagg ggtggggaat aacaatgatg ggactctggt 10920 

tcttggagcc acaaacatcc catgggtgtt ggattcggcc atcaggagga ggtgagtctt 10980 

ccccaggaga agccagggct ggaggcttcc tcccaccatg gtcacggtcc gcctgctgct 1104 0 

ggcagcccgg gtgcagcctg gccccttttc cctggagtct tcccgtctgc ctgccaagca 11100 

gagccctttg tgaggctggc ttgttgagga tgtgccgcca gcatcactgg cccatgaaat 11160 

gcgtccgttt cactcaaatc ttcgtgcctc cccttccgtg aataccattc catcatccgc 11220 

tgtcaactcc tgccgtgtgc tgggctctct aggtttgaaa aacgaattta tatccccttg 11280 

ccggaggaag ctgcccgcgc ccagatgttc cggttgcatc tcgggagcac tccccacaac 1134 0 
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ctcacggatg 
atcagcatca 
cacttcaaaa 
atgttcggtt 
tctctgaggc 
tccctgcagc 
actttactgt 
gtggacacca 
attcattcag 
tcctaagtgc 
ggtggggaga 
aggagagagg 
caggaggtga 
cagcggactt 
catacccgag 
gaggcctcag 
aatgaagaaa 
tcacgagaat 
tcccacaaca 
gcacagctaa 
ttaatgagga 
gagtcgctcg 
acccacagag 
actgaggggc 
ccaaccccag 
ccatggagat 
tggtaagtga 
tgggtccaga 
cagcactttg 
ccaatatgga 
gcacttgtag 
cagaggttgc 
ctccgtctca 
tgtagctcag 
agcattttgt 
cttgacctca 
caccatcatg 
aggccagtct 
ggattacaag 
aaagtcaggt 
cagtgaatgt 
gcccgtgcca 
aaggctcctg 
gcgggcagat 
gtctctgcta 
atttgggagg 
gagatcatgc 
aaaaaaagaa 
caggaggagg 
caagcaggcc 
tgtgtgctga 
ccttcacagt 
gacctcctga 
cacttgggca 
aacagccaga 
gccgtctgcc 
ggggcacaca 
ttctgcctcc 
ttcatcttat 



caaacatcca 
tcgtgcggga 
aggtgagtgc 
ttttttttcc 
ctcctggaga 
tcttctggag 
cttaaccctg 
aatcagtgtt 
cagatctctg 
tgggaagata 
cagtcaacac 
tgtctcgggc 
aggtgtgagg 
cagttggaaa 
actgggaaga 
aatcatggta 
caaaagtgga 
agcacgggaa 
tgtgggaatt 
accatatcac 
agacatcaag 
gaccacccag 
cattctcttt 
agctgcccca 
catgatgatt 
gacttggatg 
cttgacaggg 
ataaaaacgg 
ggaggccagg 
gaaaccctgt 
tcccagctac 
agtgagcaga 
aaaataaaaa 
ggaagctggc 
acatacagag 
cgagctcaca 
cccagct aat 
gaaactgctg 
catgagctga 
ccagataata 
gtcaagactg 
tcctggcaag 
gccgggcgcc 
cgcctgaggt 
aaaatacaaa 
ctgaggcagg 
cactgcactc 
agcaagttcg 
cccatctgag 
tctggggtgg 
ggggaggttg 
cggacatgct 
aagtgaagaa 
atggtgaagg 
cagggctcca 
agggagccag 
gtggacactg 
cccccttttt 
ttataaagat 



cgagctggcc 
ctctctcatg 
ccgcggccac 
cagctcctgg 
gccgagggcc 
tgtggccatc 
ctgcggcctc 
tggaaagtgt 
ctgagtattt 
tgagtgacca 
agtcagtgag 
atggagggaa 
gaatgagcct 
accactttgt 
aaaagaggtt 
ggagaccaaa 
aacccctgat 
agaccagccc 
ctgggagata 
acttttaatg 
gcattggggc 
cccgagttct 
gggacttcct 
gtcagatcca 
gatgacctcc 
gatgtccctg 
gagggaagag 
tcttctccag 
gtgggcagat 
ctctactaaa 
tcaggaggct 
gattgggcca 
atagtcttct 
attacactca 
aaaggctaga 
caatcttctg 
ttttcctttt 
ggctcaagtg 
gccaccacac 
tgacgtcaat 
tgacgcagtt 
ggtctgccct 
gtggctcccg 
caggagttca 
aatcagccat 
agagtcactt 
cagcctgggt 
taaggggagt 
ctgcctcgag 
ccttgaaggc 
gggacctgga 
gcggtctctg 
attctcagag 
tgggaggttg 
gggcttgtcc 
aaggaagggc 
ctcttcctac 
tttccatctt 
aaaatcacct 



cggaagacgg 
cagcccgtga 
tgctgagaaa 
tcctgctccc 
agtgccatgg 
tcagggtgtg 
ctccgtcagc 
tggatataaa 
gctgtgggtc 
acagacaaaa 
tgtctcagag 
gggcagatgg 
tgtggacatt 
gttaatctat 
taattggcct 
ggcacttctt 
aaacccatca 
ccatgattca 
gatacagttc 
tcactgtatt 
tacagattgc 
gaaggagccc 
tagagaccgg 
tttggtttca 
tgactccatg 
gggacaaact 
ggctgcacag 
gccaggcatg 
cacaaggtta 
aatacaaaaa 
gaggcagaag 
ctgcactcca 
tctatgtcct 
ctgggggccc 
tgcagtggtg 
acctcagcct 
ttttttagaa 
atcctctcag 
ctagcctgca 
ccatgggttt 
gcaaaggcag 
tgtggctggt 
cctgtaatcc 
agagcagcct 
gtgtggtggc 
gaaccgagga 
gacagactcc 
tatgagatga 
agggaagagg 
aaacctcttc 
gcctggctcc 
gccaccaccc 
gactttgggc 
attggggcaa 
cagtcaatac 
cttgcagcca 
ttcctcctct 
ttgttcccct 
ggaagtgtca 



aaggctactc 
ggaaggtgca 
aatctcatag 
cggctgctca 
gggctgcacc 
tgaaagcaga 
actgtgtgct 
atgaccagga 
ctcctgtgtg 
atcctagacc 
gatggagatg 
cactggggtg 
gggggaatgt 
tttcatgctg 
tacagttcca 
acatgctggt 
gatctcgtga 
gttacctccc 
aagttgaaat 
cctggtgaga 
cagagtccca 
gtcctcct ca 
aggggtcgag 
ggtctgtggc 
ctcaccaggg 
cttagagcct 
agcccagaaa 
gtagctcacg 
ggagttcgag 
ttagccgggt 
gatcgcttga 
gcctgggtga 
gggctgttga 
agaatgcagg 
cagtgatagc 
cctgagtagc 
acggcctcac 
cgtggcctcc 
ttttatattc 
ggctttgcgg 
cgcctcacct 
gttgggggtg 
cagcacttcg 
ggccaacata 
acgtgcctgt 
ggtggaggtt 
gtctccaaaa 
tggagcttag 
gtccctgcca 
ccaccctcag 
tgctcaggca 
ggcccacggt 
aagagagtta 
atccaggcac 
agagttccct 
cagagacact 
cctggatgct 
aaattaatgc 
aggagtgggg 



gggcgcggac 


11400 


gtcggccaca 


11460 


taagagcggg 


11520 


ggtgacacag 


11580 


cactgtcccc 


11640 


ggacagggcc 


11700 


cttgtgcggg 


11760 


agacctgtct 


11820 


cccggccctg 


11880 


ttccgttgtt 


11940 


gagcagggtt 


12000 


atgcctgaga 


12060 


tttgccaagg 


12120 


ctgataaaga 


12180 


catggctagg 


12240 


ggcaagagaa 


12300 


gacttcacca 


12360 


cctgggtccc 


12420 


gtgggtgggg 


, 12480 


cgtgacagtg 


12540 


tgcaaatcat 


12600 


gatgctaggg 


12660 


cccctctgac 


12720 


ccctctcgca 


12780 


gacccaggag 


12840 


gtggtttgca 


12900 


attgagggtt 


12960 


cctgtaatcc 


13020 


accagcctag 


13080 


gtggtggtgt 

zj zj zj zj zj zj 


13140 


aqccggqagq 

ZJ ZJZJZJ ZJZJ 


13200 


cagagtgaga 


13260 


gaatggcctt 


13320 


tggcagcatt 


13380 


tcactgcagg 


13440 


aacaggcaca 


13500 


agtgttgccc 


13560 


caaagtgctg 


13620 


tttagcaagt 


13680 


tqcagtqagq 

^ Zj ZD " -3 Z3 


13740 


gcatagcctg 


13800 


tgcagctaag 


13860 


ggaggccgag 


13920 


gtgaaac'ccc 


13980 


agtcccagct 


14040 


gcggtgagcc 


14100 


aaaaaaaaaa 


14160 


attctcagac 


14220 


taggcagatt 


14280 


ctgcgcctgt 


14340 


catactgttg 


14400 


gaatgcagac 


14460 


aaagctgctt 


14520 


tccccatgtc 


14580 


ctgctgtctg 


14640 


ccactgccct 


14700 


catcagctcc 


14760 


tgcttggatt 


14820 


cggggtggcg 


14880 
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ggggagaagc agccgtgctg ccaggtcacc cagacctcca gacagccggc tagccccact 14940 

gcccgttcct tttacgccca agttttgctc cttgagagca gattggctga tgcccctgca 15000 

accccagccc aagctctgcc tcaaagaccg agtgacataa gccattccca ccctcctagg 15060 

ttcacatcca gggctgtgtc ttccttgggg gaggagatgg tgtcgtttag atcagggtaa 15120 

ggcagtcagg cgggtgttca ccactgcctt ttcttcctct gagcgtgaga acactgaacc 15180 

cagccactgc ccctgggtcc ctgtcctgga aatggtctaa taaatccttt tcccttcttg 15240 

agctacccaa gtgatctcat ctttccttga ctcttaactg agactctgaa gggccaccct 15300 

tgcttccaat taaaagaccc tctggttttc tgtctcctct cttttctaaa gcagagcagg 15360 

aaacagaaca ctgcagttct taggcctgtt ctccagtcag gtgtgcaagg cctcctcccc 15420 

ttaaggcctc agaagtttgg ctggggatgt ttgtgggatc cagacagttc ttgccgttgt 15480 

tcggcctaca gatcagagac tgcaaagtgg gagccctgcg taccagaccc gtcctgcagc 15540 

tgtgtttgtt tggtttggtc tgccgctaac atttaaaagt cgagagttgc tgggcgcggt 15600 

ggctcacgcc tgtaatccca gcactttggg ccgctaacat ttaaaagtcg agagttgctg 15660 

ggcgcggtgg ctcacgcctg taatcccagc actttgggag gccgacgcgg gcggatcaca 15720 

aggtcaggaa atcgagaaca tcctggctaa cacagtgaaa ccccgtctca ctaaaaataa 15780 

aaaaattagc cgggcgtggt ggtgggcgcc tgcagtccca gctacttggg aggctgaggc 15840 

aggagaatgg cgtgaacccg ggggacggag cttgcagtga gccgagatcg cgccactgca 15900 

ctccagcctg ggcgacagag caagactctg tctcaaaaaa aaaaaaaaag tcgagagttg 15960 

acataaaaat tcagaattct gacttctaga agattaaatt aggtaatgtt gggcctgcgt 16020 

cctcatgggg cagcctgtca gcggccacta agtagaggct gctctgtaca gaaggggtcc 16080 

cccatcccca ctcatctgtg ttgcctgccc agatagcccc atggcc'attt cagtttgcga 16140 

cccccaacag accaaggacc cagagctggc agccctggac ccacgtccgg caggcagtga 16200 

agctgcagag gctgtcccac gtttacccca tgcatataaa cagaccaaag tcaaagcaca 16260 

ttcacacaga aagacctggc ctagtaaatg aagcttatag cctgcaaagg gctttgttcc 16320 

aaccttccct ctccaaaacc cagtactgct cctctacctt tcaaacctgg ttgggccaga' 16380 

actgattaca tggccaactc ttggctcaca ctgggcttgc actgggccat ttctttgctc 16440 

tggttcagag cctctgcaga agtgagctaa ctttggatca gtagctccaa cacttggctc 16500 

cagtgctggc aggttacccc tcaaccatag cagctgggat ctttgtggct tgggggtgct 16560 

tgtgtaagtt cttggttatg tgccctcagc caccaggata tgaaccgtgt ctgcagagct 16620 

tcactttagt gagcttacaa gtttttaaat ttttacttct agttacattt acttgaatca 16680 

gaacgttgtt tcctcattcc tactgcttaa acaccttgac aagtcctagg ggttaacaaa 1674 0 

ggttagcatg gctatggtcc atccctgtgc tctagttaga gcgtgaagac acctgacttt 16800 

ccagttgtct ctctccatga ccagcaacag gaaccactga cgctgaactt tggacagtgg 16860 

cctcagactc tggctgccag cacacaacct gccatcatcg atgttaaaca tgctgacatg 16920 

tgcagaggag tttcctccct gaaatgctct gaaattcact tctctgcctg gggattctgt 16980 

tataaacctt ctgcctacat tggctttcac tgtggaagtt gatttctaaa actctgatga 17 040 

gctcacaacg ttgacatgta tatgcgtttt ttgtgagtgc ttcctgctca aagtgggaga 17100 

gattttactg gaaatgcaat aaagtttgca ttttattgct accgatgccc accacaaaag 17160 

cgttaacctc agaaagctat aagaagcatc aaaagattgc catagccccc tgtaagagct 17220 

tctgaacatt gtccaaatga aaggaggctc agctgaattc taattttcct aaagacccat 17280 

gtttttagca tctatgagct ttctacaaat ctccc 17315 



<210> 11 
<211> 437 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 11 

Met Thr Thr Ser Thr Leu Gin Lys Ala lie Asp Leu Val Thr Lys Ala 

15 10 15 

Thr Glu Glu Asp Lys Ala Lys Asn Tyr Glu Glu Ala Leu Arg Leu Tyr 
20 25 30 
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Gin His Ala Val Glu Tyr Phe Leu His Ala He Lys Tyr Glu Ala His 

35 40 45 

Ser Asp Lys Ala Lys Glu Ser He Arg Ala Lys Cys Val Gin Tyr Leu 

50 55 60 

Asp Arg Ala Glu Lys Leu Lys Asp Tyr Leu Arg Ser Lys Glu Lys His 
65 70 75 80 

Gly Lys Lys Pro Val Lys Glu Asn Gin Ser Glu Gly Lys Gly Ser Asp 

8 5 90 " 95 

Ser Asp Ser Glu Gly Asp Asn Pro Glu Lys Lys Lys Leu Gin Glu Gin 

100 105 ' 110 

Leu Met Gly Ala Val Val Met Glu Lys Pro Asn He Arg Trp Asn Asp 

115 120 125 

Val Ala Gly Leu Glu Gly Ala Lys Glu Ala Leu Lys Glu Ala Val He 

130 135 140 

Leu Pro He Lys Phe Pro His Leu Phe Thr Gly Lys Arg Thr Pro Trp 
14 5 150 155 ~ 160 

Arg Gly He Leu Leu Phe Gly Pro Pro Gly Thr Gly Lys Ser Tyr Leu 

165 170 ' 175 

Ala Lys Ala Val Ala Thr Glu Ala Asn Asn Ser Thr Phe Phe Ser Val 

180 185 190 

Ser Ser Ser Asp Leu Met Ser Lys Trp Leu Gly Glu Ser Glu Lys Leu 

195 200 " 205 

Val Lys Asn Leu Phe Glu Leu Ala Arg Gin His Lys Pro Ser He He 

210 215 220 

Phe He Asp Glu Val Asp Ser Leu Cys Gly Ser Arg Asn Glu Asn Glu 
225 230 ' 235 240 

Ser Glu Ala Ala Arg Arg He Lys Thr Glu Phe Leu Val Gin Met Gin 
245 250 255 

. Gly Val ' Gly Asn Asn Asn Asp Gly Thr Leu Val Leu Gly Ala Thr Asn 
" 260 265 270 

He Pro Trp Val Leu Asp Ser Ala He Arg Arg Arg Phe Glu Lys Arg 

275 280 285 

lie Tyr lie Pro Leu Pro Glu Glu Ala Ala Arg Ala Gin Met Phe Arg 

290 ' 295 300 

Leu His Leu Gly Ser Thr Pro His Asn Leu Thr Asp Ala Asn He His 
305 310 315 320 

Glu Leu Ala Arg Lys Thr Glu Gly Tyr Ser Gly Ala Asp lie Ser He 

325 330 • 335 

He Val Arg Asp Ser Leu Met Gin Pro Val Arg Lys Val Gin Ser Ala 

340 345 350 

Thr His Phe Lys Lys Val Cys Gly Pro Ser Arg Thr Asn Pro Ser Met 

355 360 365 

Met He Asp Asp Leu Leu Thr Pro Cys Ser Pro Gly Asp Pro Gly Ala 

370 375 380 

Met Glu Met Thr Trp Met Asp Val Pro Gly Asp Lys Leu Leu Glu Pro 
385 390 ~ 395 400 

Val Val Cys Met Ser Asp Met Leu Arg Ser Leu Ala Thr Thr Arg Pro 

405 410 415 

Thr Val Asn Ala Asp Asp Leu Leu Lys Val Lys Lys Phe Ser Glu Asp 

420 425 " 430 

Phe Gly Gin Glu Ser 
435 
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